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Foreword 


The present collective volume is conceived of as the ideal continuation of 
my monograph Digital Papyrology, which indeed appeared as Volume I 
with the same publisher. The two volumes are part of a project initially 
named Beyond the Apparatus intended to frame past and current issues 
surrounding the digital tools and methods that are being applied to 
papyrological research and scholarship. In the monograph, I tried to sketch 
the general outlines of electronic resources (bibliographies and 
bibliographical standards, metadata catalogues, virtual corpora, word lists 
and indexes, digital imaging processes, digital palaeography, information 
media, quantitative analyses, integrated workspaces, textual databases) in 
an attempt to define Digital Papyrology as a self-standing discipline that 
deals with meta-papyri, i.e. papyrus texts in the digital space. Accordingly, 
I argued that the ultimate purpose of Digital Papyrology is the digital 
critical edition of papyrus texts. The goal of the present volume is precisely 
to investigate this purpose, from the multifaceted viewpoints of the most 
advanced trends and projects in the field: namely, the deployment of 
platforms suitable for the encoding of proper digital critical editions of both 
documentary and literary Greek papyri and the development of quantitative 
analysis methods for the evaluation of the linguistic features of the texts. 

In this challenge, I owe gratitude to my international colleagues and 
friends who have enthusiastically accepted to contribute with their 
invaluable experience in the field: in a rigorous alphabetical order, Rodney 
Ast (Heidelberg), one of the leaders of the Digital Corpus of Literary 
Papyri (whom I wish to thank for a linguistic revision of this Preface); 
Lajos Berkes (Berlin), member of the Papyri.info editorial board and 
author of several born-digital editions of documentary papyri; Isabella 
Bonati (North-West University, Pochetsfroom, South Africa), soul of the 
lexicographical project Medicalia Online; Giuseppe Celano (Leipzig), co- 
editor of The Ancient Greek and Latin Dependency Treebank, with his 
long-standing experience in treebanking and morphological annotation of 
classical texts; Holger Essler (Wiirzburg), DCLP partner and architect of 
digital projects about linguistic annotation (the Annotated Philodemus), 


image alignment and automated character recognition in the Herculaneum 
papyri (Anagnosis); Massimo Magnani (Parma), who kindly agreed to 
bring a brilliant classical philologist’s viewpoint to the evaluation of the 
issue at stake; Joanne Stolk (Ghent), co-editor of the Trismegistos database 
of Text Irregularities, with her strong experience in linguistic variation in 
the papyri and its digital treatment; Marja Vierros (Helsinki), who launched 
(and manages) the pathbreaking platform Sematia aimed at facilitating 
linguistic annotation of the papyri. 

On my side, I wish to acknowledge the fact that the volume stems from 
the project “Online Humanities Scholarship — A Digital Medical Library of 
Ancient Texts” (DIGMEDTEXT: §http://www.papirologia.unipr.it/ERC), 
funded by the European Research Council (Advanced Grant Agreement no. 
339828) at the University of Parma (2014-2016) and directed by Professor 
Isabella Andorlini, to the grateful memory of whom this volume is 
dedicated. This statement is not a matter of pure bureaucracy. The 
DIGMEDTEXT project, primarily aimed at creating a database of the 
Greek medical texts on papyrus, has been the breeding ground for more 
general — theoretical, methodological, and technical — reflections about 
linguistic papyrological phenomena and their electronic treatment, as well 
as about the digital critical edition of the papyri themselves. It is my hope 
that the entire papyrological community, and in general all scholarship 
interested in such topics, will enjoy the results reached in the past years, 
and that discussion and development may continue further in the future. 


Parma, January 10, 2018 
Nicola Reggiani 
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Fig. 1: PSI XV 1510, medical catechism on anatomy, III cent. AD: printed 
edition and digital edition at http://litpap.info/dclp/64024. 
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A very tentative example of annotation of a medical papyrus: GMP II 10. 

ID = Identifier; T = Tokenization; PoS = Part of Speech: H = Syntactic head: R = Syntactic relation [H + R + ID = 
Syntactical annotation]; N = Normalization, E = Kinds of errors [VE = vowel exchange; BD = broken diminutive; ML 
= missing letter, EC = error corrected; CE = consonant exchange]; L = Lemmatization, TT = Technical terms; A = 
Abbreviations; M = (Dia)critical marks and punctuation [The other levels of annotation do not apply to this instance. 
Note that PoS and syntactical taggings are simplified for reason of space]. I am very grateful to Dr. Giuseppe G.A. 
Celano for his essential help in the grammaticallinguistic annotation. 
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The Corpus of the Greek Medical Papyri 
and a New Concept of Digital Critical 
Edition 


The present contribution is published in the framework of the Project “Online 


Humanities Scholarship: A Digital Medical Library Based on Ancient Texts” 
(DIGMEDTEXT, Principal Investigator Professor Isabella Andorlini), funded by 
the European Research Council (Advanced Grant no. 339828) at the University 
of Parma (http://www.papirologia.unipr.it/ERC). 


1Defining and shaping a digital critical edition 


Traditionally and basically, a critical edition of a text is the printed output 
of a philological work, i.e. the process of reconstruction of a textual 
archetype (the ‘source’) among different variants, aimed at reproducing the 
original text as most exactly as possible, or, in other terms, as the fixed 
representation of a scholar’s more or less trustable opinion on that text. 
Accordingly, and rather intuitively, a digital critical edition should be 
defined as the digital output of a philological work. We will see what a 
“digital output” involves in methodological and epistemological terms but, 
to start, it must be noted that traditionally a digital critical edition is 
regarded as the digital transfer of a printed critical edition. Sometimes, this 
process regretfully gets rid of the attribute ‘critical’, so that we have digital 
editions or textual corpora deprived of apparatus criticus and therefore 
‘uncritical’, as in the well-known cases of the Thesaurus Linguae Graecae 
or of the Perseus Digital Library. This treatment presents encoding 
advantages, since one reference edition is chosen and digitized, but also 
huge disadvantages in terms of usability, because search and analysis 
functions are limited to the chosen text, without consideration, e.g., for 
textual variants, alternatives or different editorial solutions.1 Somewhat 
‘hybrid’ editions try to save the constitutio textus (the restitution of a text as 
close as possible to the supposed original) alongside the recording of 
variant readings: for example, the former Duke Databank of Documentary 
Papyri with the spelling variants (as written on the original papyrus) 
embedded within the ‘normalized’ text with special markup.2 A fairer 
transfer process preserves the apparatus criticus, which is usually displayed 
in a way that resembles the printed edition. The simplest examples are PDF 
editions (either scans of paper samples or born-digital files like the 
publications of the PHerc project),3 the most articulated ones are the digital 
editions available at the Papyri.info platform, where critical annotations, 
encoded as inline XML markup elements, are processed and displayed in 
an apparatus of print-like format, stressing the distance between the 
‘correct’ text and alternatives, variants, actual textual features. 

This traditional view is being challenged as rather uncomfortable by the 


development of digital technologies in the ancient studies, as well as by an 
increasing concern for the actual testimonies and the process of textual 
tradition: we may define it as a sort of ‘phenomenological’ approach. 
Digital projects like the Homer Multitext Project (HMT) or the Leipzig 
Open Fragmentary Texts Series (LOFTS) started envisaging a different 
approach to textual criticism, in deploying a text that is in fact a multitext, a 
fluid and dynamic network of multiple editions aligned to each other (by 
means of a URN architecture) rather than a traditional fixed structure of 
text and apparatus criticus,4 In this framework, the uneasiness of texts that 
are felt not being completely suitable for a ‘traditional’ critical edition (e.g. 
oral Homeric poetry,s fragmentary sourcess) merges with the new 
capabilities of digital infrastructures, which offer much more dimensions 
than printed paper. Hypertext is a new writing space, to which editors have 
to adapt the texts:7 


[o]nce we are able to overcome the physical limits of printed editions by joining 
together variants and conjectures referring to the same texts, it also becomes 
possible to look at the texts from a new and broader perspective, with possible 
consequences for our knowledge and comprehension of them.s 


Thence, an unavoidable fact: 


[w]e need to move in the direction of digitally conceived and initiated types of 
information and away from mopping up information from print sources.9 


As it has been put very effectively, the hypertext architecture is challenging 
the Urtext model,io and it paves the way for exploring the possibilities of 
“holistic” models where editorial choices are superseded by an interactive 
network of all extant data, with potentially infinite information layers.11 
Perhaps, the model that better describes this ideal condition is an ontology 
design: 


an ontology is the most suitable solution to represent critical editions of ancient 
texts for two main reasons: first, we want to be able to link different kinds of 
resources [...] that have in common the possibility of being referred to via 
URIs, which is one of the principles of the Semantic Web; second, information 
contained in critical editions constitutes a layer of interpretation and a 
description of relations about texts that is important to keep clearly distinct from 
the texts themselves. Indeed, the use of stand-off metadata encoded within 
ontology allows us to express an open-ended number of interpretations, whereas 
a markup-based solution would not make this possible due to obvious reasons of 
overlapping hierarchies. 12 


2Papyrology: philology in flux 


Papyrology is, in its more essential core, all about providing trustable 
critical editions (and commentaries) of papyrus texts.13 Though projected 
towards a broad historical and cultural evaluation of the textual data, 14 it is 
intimately a philological discipline:15 no one can deny that without texts 
there would exist no Papyrology. Yet it is a very peculiar philological 
discipline, since it is well aware of the fluidity of its objects of study: 16 
texts are continuously published, updated, collected, revised, corrected, 
emended, republished, and there is hunger for resources that can help 
handling an overwhelming amount of primary data.17 It is — to borrow the 
successful concept that Zygmunt Bauman launched to emphasize the fact 
of change in the modern timesis — a ‘liquid’ philology, for which digital 
environments seem extremely fitting; in particular, collaborative platforms 
like SoSOL seem the most suitable incarnation of this complex and fluid 
editorial workflow. 19 

Moreover, Papyrology has always been facing an adventurous textual 
situation, having to cope with fragmentary and unique texts and 
idiosyncratic utterances, and has developed a remarkable interest in the 
scribal and material phenomenology of textual features and transmission, 
which affects consistency in treating the wide series of textual fluctuations 
occurring in the papyri. Indeed, while philological analysis would gladly 
treat fluctuations as deviations from a standard archetype (i.e. mistakes or, 
more gently, variants) and normalize them in a reconstructed critical 
edition, they actually bear significant socio-cultural relevance and are of 
fundamental importance from the viewpoint of the phenomenology of the 
papyrus texts, its interpretation, and ancient writing culture in general. In 
other words, very often fluctuations are not used to reconstruct a text but to 
investigate relevant socio-cultural phenomena. Accordingly, — the 
papyrologists’ behaviour towards such textual flavours is twofold, and 
generates a wide variety of editorial inconsistencies that affect printed 
editions as well as digital databanks. 

As to the latter, the issue at stake is not only critical agreement or 
scholarly standards, but also (as hinted above) the usability of the tools 
themselves, in terms of searching and encoding. The best example, from 
my own experience, is the case of the word épunveia, which often occurs in 
the papyri in the iotacistic form éppnvia. The spelling ‘variant’ is treated 
differently in the printed editions, being sometimes ‘regularized’ in the 
apparatus, sometimes not, generating textual inconsistencies even within 
the very same text.20 In BGU I 326, ii 15 éppnvia is printed without 


apparatus notes, and it is reproduced in the databank as such; in the same 
text, at 1. i 1 the same word is supplied as épunveila (following the 
‘standard’ form) in the database, while all the printed editions (after the 
ed.pr.. Chr.M. 316; Sel.Pap. I 85; FIRA? II 50; Jur.Pap. 25) keep the 
‘variant’ in the lacuna too. Another ‘classical’ case is that of the verbal 
forms of yiyvopo1, which becomes y()ivowo1 in the Koine Greek.21 The 
latter forms are indeed treated as the standard in most of the papyrus 
editions, and therefore are not ‘regularized’ as variants,22 but this is not 
always consistent: editorial regularizations do occur, seemingly only when 
the verb is affected also by iotacism, often in compounds.23 On the other 
hand, we do find the classical Greek forms not being regularized as well,24 
which increases the uneasiness of anyone who would like to perform 
effective searches in the digital textual corpora. With the further 
developments of the Greek language, the situation is even more complex: 
for example, the general shift from dative to genitive in the later 
(Byzantine) instances of the language of the papyri2zs leads to further 
editorial inconsistencies. In BGU XIII 2332,20 (AD 375), for instance, 
vnapyo + genitive (uov) is regularized in dative (wou) according to the 
classical use,26 whereas in SB XVII 13947,15 (AD 507) badpyo + dative 
(uot) is regularized in genitive (ov) as if the latter was then the correct 
form.27 One must be aware of any possible spelling or syntactic 
combination to perform trustable textual searches.28 

As is apparent, papyrus texts carry a cognitive complex that is often 
hard to fit into printed editions and may find its better representation in the 
digital space, where the objects of study undergo a process of 
dematerialization. I have already argued that the development of Digital 
Papyrology, in its treatment of computerized information about papyri, 
produced the effect of working on the virtual representation (avatar) of the 
papyri themselves, which turn to be meta-texts,29 in the terms already 
envisaged by Traianos Gagos as early as 1998: 


In this new era of papyrological research, we cannot speak of a collection of 
papyri alone, but also of a collection of electronic files, data, metadata and 
digital images:30 

The availability of huge amounts of information in fully searchable textual form 
with accompanying images through these new media is altering drastically the 
definition of what constitutes a ‘text’, the way we experience reading it and, 
ultimately, the plurality of messages a text can offer to one or more readers. The 
new methods of presenting text with marked up images and the simultaneous 
availability of a variety of other research tools within the same electronic 
environment give us new ways of visualizing and approaching a given text. An 


edited text is no more a static, isolated object, but a growing and changeable 
amalgam: the image allows the user to look critically at the ‘established’ text 
and to challenge continuously the authoritative readings and interpretation of its 
first or subsequent editors. 

Furthermore, the simultaneous access to and study of thousands of texts and 
their images that could be as far apart as a millennium, in a single search and 
through the same medium, has the potential to challenge our established notions 
of the ‘messages’ a text carries within itself, its textuality and intertextuality 
[...]. As Roland Barth [sic] explains: ‘Any text is an intertext; other texts are 
present in it, at varying levels, in more or less recognizable forms: the texts of 
the previous and surrounding cultures. Any text is a new tissue of past citations. 
Bits of codes, formulae, rhythmic models, fragments of social languages, etc. 
pass into the text and are redistributed within it, for there is always language 
before and around the text’. In one or another way, papyrologists have always 
recognized the “intertextuality” of the Greek papyri from Egypt, because of the 
multicultural and multi-ethnic environment in which these texts were born. The 
development of the new electronic media in our field and the capability to 
establish these cross-links — or these intertextual signifiers, so to speak — on the 
linguistic, cultural and historical level through the interaction of multiple texts, 
images and a variety of related tools places the notions of textuality, 
intertextuality and metatextuality on a new (electronic) platform which, in turn, 
becomes part of these notions as the ‘carrier’, ‘interpreter’ and ‘distributor’ of 
these texts.31 


The concept that Digital Papyrology redefines the notion of ‘papyrus’ is 
embedded in the consideration that 


these media, when used within a wider intellectual perspective as a cognitive 
tool for research and instruction and not only as a pragmatic medium that can 
‘do certain things for us’, can challenge and redefine notions of ‘text’ and 
textuality.32 


After realizing that we are coping with enhanced papyri that are in fact 
“meta-papyri’, we need to reshape the digital edition in accordance with the 
nature of the papyrological digital data as autonomous intellectual objects 
(following the definition of what is ‘data’ for the humanists according to 
OWENS 2011),33 and the possibilities offered by the electronic meta- 
space.34 There is a momentous chance to see the digital document not as 
the mere, more or less complete reproduction of a printed critical edition, 
but as a quantum particle of a fluid universe of text transmission. This 
‘dispositive’ — in foucaldian terms3s — may find a suitable representation 
through the abovementioned ontology design, where we do not have to 
decide what is ‘regular’ or ‘normal’ and what is a ‘secondary’ reading, but 


can create an interconnected network of aligned versions, which represent 
different possible layers of textuality:36 


[o]nly with a comprehensive understanding of the content and assumptions of 
the traditional hughly-evolved critical apparatuses will we make the right 
strategic decisions for the future of textual scholarship.37 


Philology tends to overcome any textual fluctuation in favour of a 
reconstructed text that be as closest as possible to the ‘original’ source, but 
documentary papyri are actually the original source of themselves (any 
critical interventions being configured as the reconstruction of an 
imaginary archetype), while literary and paraliterary papyri present more 
complex issues, as introduced below. They are therefore among the best 
text typologies suitable for exploring new ways of conceiving digital 
critical editions. 


3The medical papyri: special technical needs of 
a special technical corpus 


Within the framework sketched above, the Digital Corpus of the Greek 
Medical Papyri project3s proved pathbreaking in applying the notion of 
digital edition to literary and paraliterary papyri, previously excluded from 
Papyri.info and object of very specific and isolated projects (CPP, THV 
etc.). The project stemmed from Isabella Andorlini’s lifelong interest in the 
medical papyri and from her own challenge to collect them in a uniform 
and homogeneous corpus.39 Her first steps went towards the printed 
medium,4o but she soon realized the strong potentials of Papyri.info to host 
dynamic papyrological editions,41 and her project later became one of the 
leading pilot test cases of the rising Digital Corpus of Literary 
Papyrology42 in envisaging new technical and theoretical strategies for the 
encoding of literary and paraliterary texts, eventually awarded with an ERC 
advanced grant (http://www.papirologia.unipr.it/ERC). 

Medical papyri are technical texts: they have been conceived to convey 
a technical knowledge, i.e. theoretical and practical specialized information 
at the same time — a knowledge that is, in turn, mirrored and refracted in 
the different written genres encompassed by the corpus.43 The importance 
of medical technical skills is apparent, and not only for health reasons 
(think of Galen’s instructions to the patients so that they can choose the 
best doctor after an enquiry on his skills):44 one might recall P.Oxy. I 40 (+ 


BL1312, V 74, VI 95; Oxyrhynchus, II cent. AD), a copy of the report of a 
court judgement where a public doctor claims for immunity from some 
liturgies, and the judge, after a rather witty remark, requests a scientific 
proof of his assertion.45 The importance of written text for this education is 
stressed as earlier as in the Hippocratic corpus: “I consider the ability to 
evaluate correctly what has been written as an important part of the art” — 
says the author of the Epidemics — “He who has knowledge of it and knows 
how to use it will not commit, in my opinion, serious errors in the 
professional practice” (Epid. Il 16 = II 10,7 ff. L.). In fact, the 
transmission of this knowledge was carefully carried out through a 
specialized education, which was based on oral teachings later entrusted to 
written supports. In the introduction to the treatise On his own books, Galen 
himself explains how in the context of the oral lesson one used to take 
written notes, thence moving to the publication of memoranda, the 
hypomnemata of the lessons heard.46 

Stemming from both the knowledge of oral teaching and the know-how 
of practical records and individual experience, every medical writing is not 
a fixed book but a tool in flux: the older treatises are annotated, 
commented, collated often against annotated and commented copies,47 
transcribed with additions, corrections, and updates; the collections of 
personal notes on clinical cases, therapies or remedies are constantly 
revised on the ground of practice; prescriptions are transcribed, exchanged, 
collected, gathered in the receptaria and passed down; handbooks of 
different typologies are used to teach again, and so on, keeping on the 
written support traces of every stage of transmission and use.48 

Such texts are further characterized by intertextual and transtextual 
connections: references, quotations, more or less literal parallels, are 
another key to understand and contextualize the matter at the best. The 
recipes, and their collections known as receptaria, find inspiration in the 
pharmacological treatises and are further enriched by the doctors’ personal 
practice and by references and quotations from different medical sources; 
the questionnaires are connected to the literary tradition of the Definitiones 
medicae (see below). These are by no means stemmatological relationships 
between ascendants and descendants: it is a fluid knowledge undergoing 
continuous changes, updates, adaptations, much influenced by oral teaching 
and actual practice. Accordingly, the very textual data interweave with a 
huge panel of textual devices, which contribute to articulate an expressive 
network that is essential to the medical writing itself, to its transmission, to 
its learning, and to its practical use: therefore, they deserve a particularly 
careful consideration. Critical and diacritical marks, punctuation, graphical 


and layout features, technical terms and formulae, literary or sub-literary 
references or echoes, marginal annotations — to cite the most outstanding 
devices — form a complex interplay that cannot be separated from the text 
itself, nor — even more — ignored, without compromising the correct 
interpretation of the evidence. Rigid definitions of philological variants do 
not really apply, as well as the treatment of linguistic variants can be more 
complex than the simple application of regularization markup tags, which 
categorize a ‘standard’ (not to say ‘correct’) and a ‘deviant’ version of a 
word.49 

The inadequacy of the traditional philological/stemmatological model 
to represent in full the textual features of these complex and fluid technical 
writings has already been pointed out by Ann Hanson,50 who advanced an 
“accretive model of composition” to provide a suitable description of the 
phenomenon. In David Leith’s words, 


[t]he textual tradition of compilations of this sort was highly fluid, and we 
should not conclude that they represent exactly the same text.s1 


Digital tools offer now the best solutions to face this challenge, which 
Isabella Andorlini herself envisioned at the very beginning of the Corpus 
dei Papiri Greci di Medicina project, a primary focus of which was 


la nuova attenzione rivolta alle problematiche editoriali dei testi studiati nella 
complessita dei rapporti con le fonti rispetto alla tradizione conosciuta.52 


4Envisioning digital critical editions of (medical) 
papyri 


As I anticipated above, a digital critical edition can be defined as the digital 
output of a digital philological work. This has a vital outcome in terms of 
data encoding. Indeed, encoding data involves a digital critical workflow 
that takes the features of the original texts (and of the printed editions) to 
adapt them to the digital medium. It requires a thorough philological work, 
namely a digital philological one, where the digital papyrologist is — to 
paraphrase Youtie’s well-known definition — an artificer of data (in the 
abovementioned, intellectual meaning of ‘data’). Any information taken 
from the text or from previous editions becomes data (or metadata, i.e. 
‘data about data’); and even when encoding a print-published edition, one 
should check carefully the original text to avoid possible inconsistencies 


and ambiguities inherited by the previous editors, so that the ‘liquid’ 
editorial flux goes on. 
Moreover, 


[e]ncoding fragments is first of all the result of interpreting them, developing a 
language appropriate for representing every element of their textual features, 
thus creating meta—information through an accurate and elaborate semantic 
markup. Editing fragments, therefore, signifies producing meta—editions that 
are different from printed ones because they consist not only of isolated 
quotations but also of pointers to the original contexts from which the fragments 
have been extracted. On a broader level, the goal of a digital edition of 
fragments is to represent multiple transtextual relationships as they are defined 
in literary criticism [...]. Designing a digital edition of fragments also means 
finding digital paradigms and solutions to express information about printed 
critical editions and their editorial and conventional features. Working on a 
digital edition means converting traditional tools and resources used by scholars 
such as canonical references, tables of concordances, and indexes into machine 
actionable contents.53 


Therefore, “encoding a text is an interpretive act’s4 by itself: on the one 
hand, the encoder (the digital papyrologist) must employ as much criticism 
and careful discernment as possible in order to give the papyrological 
object its correct digital representation. On the other hand, one must be 
aware of the fact that the digital medium has different requirements than 
the printed one. While philology is a way of describing a text to interpret it 
as a Stable source, a phenomenological approach is a way of representing a 
text in all its components, to describe and understand the underlying 
semantics. Therefore, when we choose to overcome the inadequacies of a 
traditional critical edition in favour of the digital multi-space, we must keep 
in mind the following three fundamental requirements: 
—standardization (adapting to the digital medium means to follow its strict 
rules);55 
—semantic representation (which may differ from the traditional 
philological representation, as we will be noticing below); 
—usability (in terms of data access, searching and developing options). 


Data and metadata can be encoded and used as different, yet interconnected 
(aligned) information layers.s6 An XML annotation markup seems to be the 
best encoding strategy, since it has a consolidated background in the TEI/ 
EpiDoc system that has already been adapted to the papyrological 
requirements,57 providing a standardized and standardizing framework, a 
semantic annotation, and powerful search options through XPath and 


XQuery querying languages.5s It also allows for any kind of final rendering 
by means of customizable transformation languages (XSLT). Alignment 
among layers can be achieved by deploying a CTS URN architecture, 
which is useful to give unique identifiers to each element and to avoid 
overlapping hierarchies, especially in linguistic annotation. Annotated 
layers can be stored in a GIT repository so that open access and 
collaboration are granted. Some layers already exist in the SoSOL 
infrastructure (metadata, introduction and commentary, translation, 
annotated text); more can be envisioned, for example, on the ground of 
Gérard Genette’s textual theory, which describes all possible relations 
among texts and which has already been claimed as the privileged 
interlocutor of the complex textual ‘dispositive’ of papyrus texts.59 

Without any presumption of emulating Herbert C. Youtie, who 
provided the standard outline of a ‘canonical’ papyrus edition,so what 
follows is an attempt of systematizing the extant strategies for encoding a 
digital papyrus edition, with some suggestions for possible further 
improvements. The past work on the medical papyri provided the most 
complex and intriguing cases, but the same recommendations can apply to 
simpler cases too, as well as to documentary papyri of any sort. 


4.1Metadata and bibliography 


Papyrus metadata (i.e. contextual information about texts: chronology, 
provenance, etc.) are currently stored in digital catalogues like the 
Heidelberger Gesamtverzeichnis (HGV) for the ‘documentary’ texts, the 
Leuven Database of Ancient Books (LDAB) and the Mertens-Pack3 (M-P3) 
for the ‘literary’ ones, Trismegistos (TM) for both, and some more 
specialized ones like the Corpus of Paraliterary Papyri (CPP) or 
Synallagma. Similarly, digital bibliographical repositories exist, namely the 
Bibliographie Papyrologique (BP) and Trismegistos bibliographies.61 
Papyri.info and the DCLP currently include metadata from (respectively) 
HGV + TM and from LDAB, 62 and point to BP records as well as to some 
more resources (e.g. Synallagma). A digital critical edition of medical 
papyri should of course extend this feature to include the Mertens-Pack? (in 
its specific Medici et medica section)63 and possibly to envision some 
digital version of Marganne’s and Andorlini’s printed catalogues of medical 
papyri.64 Contextualization is indeed fundamental: 65 


[I]o studio del manufatto e una sua corretta collocazione cronologica sono 


informazioni essenziali, che possono interferire con le ipotesi di attribuzione dei 
contenuti, sia per il rapporto con gli autori noti, sia per un’adeguata 
impostazione dell’indagine sulle fonti e sugli anelli della tradizione indiretta. La 
provenienza del reperto papiraceo pud, nei casi in cui gli elementi archeologici 
siano conosciuti, conservare dati preziosi sul contesto in cui inserire le farine di 
produzione libraria antica, e sui livelli della sua divulgazione in Egitto (centri di 
diffusione legati alle vie dell’insegnamento e della pratica della disciplina; 
biblioteche templari, scuole mediche specializzate): l’attenzione ai luoghi 
accertabili di ritrovamento dei reperti ci permette di delineare il milieu culturale 
in cui libri di questo genere furono prodotti, o semplicemente letti, da fruitori 
professionisti e da gente colta con qualche interesse per i temi della salute. 6« 


Moreover, the standardizing potential of digital metadatao7 would be a nice 
ground to deal with the problem of the definition of textual genres or 
typologies, and to face the challenge of a categorization, an issue that is 
well framed — from the medical viewpoint, in the wake of Isabella 
Andorlini — by Francesca Bertonazzi in the following words. 


Classificare 1 papiri per tipologia non é solo un mero esercizio erudito 0, peggio, 
sterilmente matematico nel senso deteriore del termine: al contrario si configura 
come un’indagine che puo gettare luce sul contesto di composizione e d’uso del 
testo, e non di rado pud agevolare la sua ricostruzione filologica e 
l’interpretazione esegetica. L’attivita non é priva di rischi: un primo problema é 
sottolineato da quanti mettono in guardia dalla rilevanza statistica dei dati che 
possono essere desunti dai papiri, che sono inevitabilmente vincolati ai 
ritrovamenti, al tipo di descrizione fornita dal primo editore, dal tipo di 
classificazione operata nei primi studi sul testo. Il primo ostacolo, per cosi dire, 
é dunque di natura extratestuale, ovvero risiede nella mera quantita di papiri 
appartenenti a una data tipologia: anche se la maggior parte dei papiri medici 
afferissero al genere, e.g., del trattato, non per questo si dovrebbe concludere 
che il trattato fosse il genere pit praticato in ambito medico nell’Egitto greco- 
romano. Un secondo problema, di tipo intratestuale, risiede nella tipologia 
stessa del documento, che spesso non appartiene in modo netto all’uno o 
all’altro tipo di testo: “Chi si ¢ occupato anche solo marginalmente della 
interpretazione di frammenti di papiro a contenuto ‘medico’, avra constatato 
come una delle difficolta pit evidenti ¢ quella del riconoscimento e della 
definizione del genere testuale, del tipo di opera cui appartennero brani parziali 
di scritti oggi in larga parte perduti. Una difficolta dovuta, oltre che alla 
casualita e alla precarieta del reperto papiraceo, anche alla organizzazione stessa 
delle opere a contenuto medico, teorico o specialistico che fosse: il 
riconoscimento di soggetti e termini medici é da solo insufficiente per dirci 
qualcosa di piu preciso sull’impostazione dell’opera originaria, in quanto le 
singole nozioni tecniche ricorrevano in settori diversi della disciplina, e 
potevano essere esposte o discusse a livelli di approfondimento e di 


concettualizzazione anche molto distanti tra loro” [ANDORLINI 1997b, 159]. 

In quest’ ottica, lo studio del corpus offre alcuni casi interessanti di testi a mezzo 
tra una o l’altra tipologia (come P.Oxy. 2.234 + 52.3654,92 tra il catechismo e 
la raccolta di prescrizioni), oppure di informazioni testuali insufficienti a 
distinguere con precisione l’appartenenza tipologica (come in P.Oxy. 74.4973: il 
testo potrebbe riguardare la veterinaria come la fisiognomica), o ancora di testi 
che pur rientrando nella categoria ‘lettera’, possono avere natura documentaria 
(come MPER 13.6 e GMP 2.10, lettere redatte da medici, e P.Mert. 1.12, lettera 
a un medico) oppure letteraria (POxy. 9.1184 raccoglie varie lettere di 
Ippocrate). 

Un terzo problema, di ordine linguistico, risiede nella terminologia moderna 
utilizzata per classificare i testi: non di rado si € avvertita la necessita di 
puntualizzare le varie accezioni di ‘etichette linguistiche’ attribuite a generi 
antichi: “[nJel classificare la ricettazione nei papiri ho volutamente differenziato 
Puso del termine ‘prescrizione’ (applicato a medicine complete di indicazione 
terapeutica, norme estese alla preparazione e all’uso dei rimedi), da quello di 
‘ricetta’ (applicato a formule assai semplificate, limitate all’indicazione dei 
componenti, attestate anche singolarmente su foglietti di papiro ed ostraca). 
Con ‘prescrizione’ e ‘ricetta’ identifico percio tipologie leggermente differenti 
di testi. Definisco col termine ‘ricettario’ un testo poco elaborato formalmente, 
che raccoglie ricette 0 prescrizioni; con ‘manuale terapeutico’ intendo uno 
scritto in cui si riconosce un’organizzazione compositiva e formale piu 
complessa, ora prodotto nell’ambito dell’insegnamento della disciplina, ora non 
diverso dai modelli di ‘trattato’ terapeutico” [ANDORLINI 1993, 469-70 n. 
22].68 


4.2Introduction and commentary 


The possibility to add a “front matter” and a “line-by-line commentary” is 
currently allowed by both Papyri.info and the DCLP, though it has been 
poorly exploited so far. Some documentary samples have been produced in 
the framework of the born-digital editions described by L. Berkes in this 
volume; for the DCLP side, see R. Ast and H. Essler ibidem. The DCGMP 
project utilizes systematically this feature to provide a general introduction 
to each text and to record the main textual features that cannot be encoded 
within the text for the moment, namely technical descriptionse9 (see below 
for future integration with the Medicalia Online lexical tool) and parallel 
passages in both other medical papyri and literature (see below for the 
intertextual layer). 

An earlier way of inserting short comment strings (mostly providing 
information about re-editions of the texts) within the inline markup, 


through the <note> tag (Leident: /* */), is possible but definitely not 
exploited nor really recommended (the Leiden+ guidelines warn: “use 


sparingly”!) 


4.3Translation 


Translations of the original text in multiple modern languages are currently 
supported in the existing databases. The DCGMP policy is to produce at 
least an English translation of each text, but when a scholarly translation in 
a different language does exist, the preference is granted to that one. As 
long as translation is a means of interpretation, the possibility to align the 
original text with its translation(s)70 is worth being explored, for instance 
through the Medicalia Online lexical platform (see below). 


4.4Materiality 


The physical appearance of the papyrus is of the utmost importance for the 
papyrologists, who are deeply interested in the material aspect of the 
fragments.71 Size and colour are the first physical features that are indicated 
in a traditional edition, and since they are not recorded in the metadata 
catalogues, they should be indicated in the introductory matter (see above) 
or in new metadata fields. A digital picture could compensate for this, but it 
is not available for all papyri (see below). 

Material features of the writing support are encoded directly in the text 
itself according to the current standards. As to this point, there are some 
notabilia that must be stressed because they slightly differ from the 
traditional editorial practice. Line numbers, for example, are to be indicated 
for each line (contrarily to what happens in most of the printed editions) in 
a standardized way (number-dot-space); words that wrap between two lines 
are indicated with a hyphen after the dot of the second line number, not at 
the end of the first line. Both Leiden+ procedures may seem rather 
unconventional to ‘traditional’ papyrologists, but they are grounded on 
XML requirements: numbers are related to “line break” tags (<lb/>) that 
must open each new line of the encoded text; hyphens represent the 
attribute break="no" in the same <Ib/> tag, meaning that the new line does 
not break the word.72 In the HTML output things are brought back to the 
traditional display (line numbers grouped by five, hyphens at the word 
break). 


Writing sides (recto/verso, folios in codices) and multiple fragments are 
encoded as document divisions (XML <div type="textpart">). This tag 
deploys an n attribute, which expresses the number/letter identifying the 
fragment/folio (or the letters r/v for recto/verso), and a subtype attribute, 
defining the type of part: "fragment", "folio", but also "column" or "part" if 
the text is divided into different layouts or sections even within the same 
writing side. By the way, this is a good way of dealing with texts that are 
composed by several sub-texts, like e.g. collections of letters or recipes.73 
Divs can be nested if needed, and each text block is anyway enclosed by an 
<ab> tag (“anonymous block”). In Leident+, Divs are introduced by the tag 
<D= followed by the said attributes preceded by dot (e.g. <D=.r for recto, 
<D=.1.fragment for fragment 1), the text block by <=. Every text tag must 
be closed at the bottom, paying attention to the correct order (divs are 
opened before <ab> at the beginning, and symmetrically closed at the end). 

The most remarkable physical feature of the papyri is fragmentation. 
This usually results in marginal breaks (printed as rows of dashes at the top 
and/or at the bottom, closed square bracket on the left, open square 
brackets on the right) and in-text gaps (represented as square brackets 
surrounding some indication of the missing text, which may or may not be 
supplemented). They are currently encoded as in-text markup; however, the 
digital concept of gap, according to the TEI/EpiDoc canons, is slightly 
different from the traditional one, and it deserves some comments. Each 
unsupplied break or lacuna is indeed treated as missing text, and all types 
of missing text are handled with the <gap> XML tag, which is used to 
encode both lost and illegible portions of text (the latter being usually 
printed — and are displayed — as series of dots). The <gap> attribute reason 
distinguishes the two cases ("lost" or "illegible", plus "ellipsis" if the text is 
missing because left untranscribed by the editor), while the attribute unit 
specifies if we are dealing with just a number of characters or with entire 
lines. The extension of the missing text is defined by the attributes extent 
("unknown" number of chartacters or lines), quantity (known number of 
characters or lines), atLeast / atMost (approximate range calculation). A 
precision attribute set to "low" indicates the uncertainty of an extension. 
Leiden+ syntax developed around the use of dot, after the print conventions 
of indicating illegible characters by means of dots: a dot followed by a 
number or a range (and by the indication lin when dealing with lines) 
indicates illegible text; the same, but preceded by the indication lost, marks 
lost text. If the dot is preceded by vestig, an element <desc>vestiges</ 
desc> is added to the <gap reason="illegible"> tag, in order to encode 
generic ‘traces’ (which is indeed the HTML output). Untranscribed text is 


marked differently (see below), as are supplied gaps, though from the 
papyrological viewpoint they are actually the same facts as the unsupplied 
ones (see below). 

Unclear characters are another good example of how semantic markup 
differs from traditional print editions. In the latter, any unclear letter is 
marked with an underdot, either with the letter on its top (if legible) or not 
(if illegible). In the digital edition, illegible characters are non-textual 
portions marked with the <gap reason="illegible"> tag described above, 
while unclear but legible characters are text portions marked with an 
<unclear> tag. Leiden+ utilizes the regular Unicode underdot in the latter 
case, while in the former the dot is recalled with a full stop followed by the 
number or range of unclear characters. In the HTML display, they become 
both underdots. 

The close relationship between the text and its support is the core focus 
of the CRMtex project (http://www.cidoc-crm.org/crmtex), which provides 
tools for managing the study and publication of ancient handwritten 
documents and may be taken into consideration for developing new 
strategies in the digital edition of papyrus texts too. 


4.5Palaeography 


Annotating palaeography is a huge task. Beside a general palaeographical 
description of the handwriting, which may well be detailed in the front 
matter, the possibility to mark up each single character is particularly 
tricky. Apart from its extreme intricacy, such a task should be preceded by 
a huge effort to standardize palaeographical terms and descriptions, which 
are notoriously idiosyncratic and inconsistent. Text alignment with the 
digital picture can help: this is precisely the purpose of the Anagnosis 
project, conducted at Wirzburg by Holger Essler, which may eventually 
come to the automatic recognition of the characters.74 

Some visual characteristics of the written text are encoded with an 
appropriate markup that describes the appearance of lines, words or single 
characters with special display features. Lines that are written 
perpendicular or inverse with respect to the main body of the text can be 
encoded with a rend="perpendicular"/"inverse" attribute of the <lb/> tag (in 
Leiden+, this is obtained by putting the line number in brackets, typing a 
comma+space instead of dot+space, and then the appropriate attribute 
value). Similarly, ancient text highlights (taller characters, superscript, 
subscript, supraline, underline) are tagged with a <hi> element, with a rend 


attribute specifying the kind of highlighting (standard values: "tall", 
"superscript", "subscript", "supraline", "supraline-underline"). Leiden+ 
equivalents are shaped in a graphical appearance that hints to the text 
display on the papyrus (respectively: ~||x||~tall; |x’; \|x|/; x ; =x=). It 
must be noted that currently the use of this markup is deprecated when the 
highlighting describes an abbreviation (see below). A text written inside a 
box is encoded with a milestone element (<milestone rend="box" 
unit="undefined"/>; Leiden+: ###).75 

Another palaeographical feature that can be encoded with the current 
markup is the handshift (<handShift new="m2"/>; Leident+: $m2; 
displayed as (hand 2)); for the use of this tag see also Marja Vierros’ 
chapter in the present volume (which — by the way — contains also an 
interesting discussion about palaeographical metadata). 


4.6Text 


At the core of the papyrus fragment, text as a linguistic fact deserves the 
highest and deepest attention, for both the peculiarities of the language of 
the papyri in general and the specific relevance of technical language in 
small corpora like the medical writings.76 Digital annotation is a 
fundamental practice in the linguistic study of a corpus of texts:77 it allows 
to describe, record, interpret and analyse linguistic information at several 
levels, in which each layer corresponds to a particular category of relevant 
information.7s Multiple levels of linguistic annotation of papyrological 
relevance can be outlined.79 


4.6.1Part-of-speech annotation 


The basic annotation layer, related to the analysis of the parts of speech — 
also known as treebank because it is usually represented with a tree graph 
—, would allow to conduct an extensive lexical, phraseological-formulaic 
and syntactic analysis on the corpus, aimed also (but not only) at 
discovering styles and writing strategies specific of the medical texts, both 
literary and documentary: think only of the possibility to investigate 
formulaic uses and writing skills,so to find out influences or interpolations 
between authors, or the presence of literary echoes in technical or 
documentary texts.s1 The entire technical textual strategy deployed by 
medical authorss2 could be studied in this way. Analysing in depth and 


comprehending the syntactic structure of texts would allow also to solve 
problems of interpretation and attribution,s3 or even only to understand the 
exact meaning of a text (let us consider for instance the case of schematic 
prescriptions — e.g. P.Oxy. VIII 1088, http://litpap.info/dclp/63118 — where 
implicit verbs and asyndetic syntax would have to be made explicit). 

In the field of classical philology such linguistic analyses are now at a 
very advanced level, but papyrology too has made important progress, with 
the project Sematia, aimed at facilitating the linguistic tagging of the 
documentary papyri encoded in Papyri.info and described by Marja Vierros 
in this volume. Another possible way to linguistic annotation of the papyri 
is explored by Giuseppe Celano in the present book as well. The literary 
side has been unfolded by the Grammatically Annotated Philodemus 
project, conducted by Daniel Riafo Rufilanchas and Holger Essler 
(Wiirzburg) and aimed at deeply annotating the Greek philosophical papyri 
from Herculaneum on morphological, grammatical, semantic, stylistic 
layers.s4 Fragmentation is of course an issue when one decides to perform 
linguistic analysis: phrases, sentences, words are broken and it is not rarely 
difficult to understand the syntax, not to say to tokenize the words.s5 These 
are problems that digital tools must unavoidably face, and which an 
infrastructure based on multiple interconnected layers may feasibly 
overcome. 


4.6.2Lemmatization 


An annotation layer of lemmatization, that is the reduction of a declined or 
conjugated word to its original lemma, would prove essential in defining 
and analysing a specialised technical vocabulary like the one employed in 
the medical papyri, which has always been a relevant research focus of 
Isabella Andorlini’s concept of the medical corpus.s6 Such a sort of layer 
would represent an important bridge to connect the textual database to the 
related project Medicalia Online, consisting in an extensive lexical 
reference platform for ancient medical technical terms, as described by 
Isabella Bonati in this volume.s7 Systematic links to the lexical records 
(and the other way around) could contribute to create a dynamic lexiconss 
of medical technical terms in the Greek papyri. In addition, as Joanne Stolk 
observes in this volume, the possible deployment of a lemmatization layer 
would help encoding linguistic variation more properly, while encoding 
lexical information would be helpful for the creation of word reference 
indices. 


4.6.3Abbreviations 


Abbreviations are another striking point. Medical writings (prescriptions 
above all, but not only) make a particularly extensive use of abbreviated 
words,s9 developing a proper “graphical-expressive jargon”;90 given their 
technical nature, it would be extremely useful to investigate their use, e.g. 
whether there is any underlying pattern. As to now, abbreviations are to be 
encoded in the same way as the documentary papyri, that is according to 
the type of expansion — resolved or unresolved, distinguished on the ground 
of the XML syntax. Resolved abbreviations are encoded as “expansions”, 
with the <expan> tag enclosing the text spelled out and the <ex> tag 
enclosing the text abbreviated (Leiden+: double set of brackets, one 
enclosing the whole word and the other one enclosing the expanded 
abbreviation); unresolved abbreviations are encoded as “abbreviations”, 
enclosed by the <abbr> tag (Leiden+: (|x|)). Any attempt to encode the type 
of abbreviation (e.g. by raised letter or by overline) is currently 
deprecated.91 I strongly hope that in the future this level of annotation may 
be taken into consideration, since abbreviating strategies are relevant for 
the correct transcription and interpretation of texts, as in P.Strasb. inv. 1187 
(http://litpap.info/dclp/59968), which 


exhibits two cases of allegedly abbreviated words that have been object of 
interpretative discussion. At Il. 11 and 14 two v overlined with a horizontal 
stroke (belonging to a plural genitive and a nominative respectively: -@ ) are 
clearly legible; these strokes are abbreviation marks according to FAUSTI 1989, 
158, contra MARGANNE 1998, 68, following ed.pr. for the latter, which supplies 
the v as omitted by the scribe, in angle brackets. The presence of the overline 
strongly suggests that we are indeed dealing with abbreviated words: therefore, 
though relying by rule on the more recent edition, [for the digital edition] it has 
been chosen to follow the editio altera, marking the abbreviations according to 
the current Leiden+ conventions, though preserving the reading of the editio 
tertia in an |ed| tag.92 


In the described case, a correct understanding of the abbreviation mark 
proves essential in the text editing and encoding. Moreover, special ways of 
expressing combinations of characters or even entire words cannot be 
encoded but in the standard, simplified way: for example, to limit ourselves 
to the cases of P.Ant. HI 127 (http://litpap.info/dclp/65340) mentioned by 
CORAZZA 2018b, the sinusoid for a1 and the peculiar sign -// for e101, 
which must be encoded as whichever symbol <expan><ex>sto1</ex></ 
expan> (Leident+: ((€t01))), losing interesting pieces of information. 

A tentative proposal, based on a preliminary survey conducted on the 


different abbreviation typologies in the Greek medical papyri and on the 

TEI/EpiDoc XML guidelines, envisions the following possible instances 

(apart from ‘traditional’, ‘simple’ abbreviations):93 

—Supralinear abbreviations (e.g. yoAKav%(ov): PSI X 1180a i11,12). The 
superscripted letter may be tagged as any normal superscripted letter 
(<hi rend="superscript"> tag); such a combination is already possible 
and correct in the current Leiden+ syntax, yet deprecated by the official 
guidelines. XML: <expan>yoAKkdév<hi — rend="superscript">@</ 
hi><ex>ov</ex> </expan>; L+: (yadkav|*0"|(ov)). 

—Abbreviations by stroke (horizontal: e.g. t@—(v), P.Mich. XVII 758 H 
verso,2; vertical: e.g. E\(npdov), P-Mich. XVII 758 H verso,3; slanting: 
e.g. yOABGv/(y), P.Mich. 758 H,11; sinusoid: e.g. yiyvet|(a1), P.Ant. II 
127, i b, 6). The strokes may be encoded through the EpiDoc <am> tag 
(“abbreviation mark’’)o4 and further defined as non-alphabetic glyphs 
(see below) as follows: <expan> <abbr> 1@ <am> <g type="horizontal- 
stroke"/> </am> </abbr> <ex>v </ex></expan> = (1 *horizontal- 
stroke*(v)) ; <expan><abbr> ¢€<am><g_ type="vertical-stroke"/></ 
am></abbr><ex>npov</ex> </expan> = (&€ *vertical-stroke* (npdv)) ; 
<expan> <abbr> yoABav <am> <g type="slanting-stroke"/> </am> </ 
abbr> <ex>n </ex></expan> = (yoABdav*slanting-stroke*(n)) ; <expan> 
<abbr>yiyvet<am><g_ type="sinusoid"/></am></abbr><ex>a1</ex> </ 
expan> = (yiyvet *sinusoid*(a1)). Note that such combinations are 
correct in the current Leiden+ syntax, but the strokes need to be 
rendered properly in the HTML output; moreover, the <am> tag is not 
supported by the platform. 

—Discontinuous abbreviations (e.g. u(e)t(a): MPER n.s. XIII 9, 1). This 
type of abbreviation is already normally working in the SoSOL 
environment. <expan>y<ex>s</ex>t<ex>d</ex></expan> = (u(€)t(G)). 

—Abbreviations by monogram (e.g. oyu(otod); mp(6c); yp(@)).95 This type 
exploits the way in which monograms are marked up in EpiDoc and 
Leident,96 i.e. a ‘g-type’ with indication of the letters that are 
interwoven to form the monogram. <expan> <abbr> <am> <g 
type="monogram">oyi</g> </am> </abbr> <ex>oyiotod</ex></ 
expan> = ((*monogram,oyi*oyiotob)) ; <expan> <abbr> <am> <g 
type="monogram"> mp </g> </am> </abbr> <ex> mpdc </ex> </expan> 
= ((*monogram,mp*mpoc)) ; <expan> <abbr> <am> <g 
type="monogram"> yp </g> </am> </abbr> <ex> yp@ </ex> </expan> 
= ((*monogram,yp*yp@)). This may apply to some symbols for units of 
measure too, e.g. Ai(tpa), o(d)y(yia), etc., which may make easier a 
systematic study of quantities and dosages in the ingredient use. 


Number digits and values can be easily encoded in the current way 
(XML: <num value="16">1¢</num>; Leident: <# 1c=16#>). 


4.6.4Linguistic variation 


The topic of linguistic variation is the most intriguing and difficult to 
handle. As noted above, traditional critical editions tend to overcome any 
fluctuation in favour of a reconstructed text, while fluctuations are actually 
fundamental for the phenomenology of the written text. Linguistic variation 
in the papyri has already been extensively investigated by Joanne Stolk, 
who resumes her thoughts from the digital perspective in this same volume. 
I would like just to focus on some relevant points, to introduce the problem 
of linguistic variation in the medical papyri. The current markup of what I 
call textual fluctuations — handled by the <choice> tag, “indicating that [the 
readings] are two editorial versions of the same span of text, and should be 
read as alternatives, not shown side by side’’s7 — distinguishes between 
“corrections” of outright, well recognizable scribal mistakes (<corr> tag 
marking the correction, <sic> tag marking the original reading; Leiden+: 
<:correction|corr|original:>) and “regularizations” of phonetic misspellings 
(<reg> tag marking the regularization, <orig> tag marking the original 
reading; Leiden+: <:regularization|reg|original:>).9s Though the treatment 
of ‘regularizations’ has been improved during the history of the 
papyrological databases, by moving the display of the original spelling 
from the apparatus to the main text and vice versa,99 showing a stronger 
care for the phenomenology of the papyrus text, and though the meaning of 
the <choice> tag points to alternative encodings of the same text portion, 
the fact is that we are still dealing with a differentiation between a form that 
is considered as ‘standard’ or ‘regular’ and a form that deviates from it. 
Phonetic fluctuations like opbpvy / Cubpvn in the medical papyritoo (but 
see also some relevant cases in the documentary papyri, like ybtpa / 
«Kb9paio1 and &(ioontpov / d6ovmTpovi02) show that not always is it easy to 
define what is the ‘conventional’ spelling and what is the ‘deviation’, so 
that a layer capable to align the ‘variants’ to each other, word by word, 
rather than categorizing them in a sort of hierarchy, may be much 
welcome. 103 

Diachronic and synchronic fluctuations — depending on the evolutions 
and transformations of Hellenistic Greek language and on the rise of 
personal or geographical substandards104 — do occur in the medical papyri, 
but their existence not rarely points 
to deeper levels of textuality, with reference to ancient literacy and 


intertextual relations (see below), and therefore deserve a very peculiar 
attention. For example, in P.Aberd. 124, i= GMP I 1, 1 (II cent. AD, http:// 
litpap.info/dclp/63334), a papyrus fragment preserving chapter 37 of 
Hippocrates’ treatise De fracturis, at 1. 14, where all the codices (and the 
editions) have the ‘regular’ Ionic form mnyeoc, the papyrus shows clearly 
m|nyews, the Koine form, which looks like an ‘interference’ of a typical 
‘linguistic variation’ pertaining to the language of the documentary papyri, 
where it would be the standard form.10s Perhaps even more significant are 
the following cases. P.Oslo inv. 1576, a fragment of a catechism dealing 
with tumour-like diseases, 106 partly overlaps with the text of POxy. LXXX 
5239 (both II-III cent. AD). The latter is more likely a ‘treatise’ than a 
questionnaire, as its editor David Leith notes (see below for this 
distinction), and the difference may be perceived from the lack of eistheseis 
in its questions. The scarceness of the surviving portions of text makes it 
hard to say whether the questionnaire derives from the treatise or they are 
two different outcomes of a same ascendant (see below for intertextual 
relations). As far as the extant parallel text is concerned, the wordings 
diverge from each other only for one variant: bDdpoKnAn (P.Oslo, |. 5) vs 
[v]ypoKnAn (P.Oxy., |. 15). The latter is usually considered as a minority 
variant (LSJ, quoting Poll. IV 203) of the former, used e.g. by Ps.Gal. 
Def:med. 424 = XIX 447,12-13 K., but it is in fact attested three times 
among the medical writers.107 Are we facing a trivialization in the Oslo 
papyrus, or a simple phonetic variant in the Oxyrhynchus papyrus, or just 
two different traditions bearing the same degree of ‘correctness’, attesting 
to a fluid notion of technical language? Moreover, in the following line of 
the Oslo papyrus (not paralleled by its Oxyrhynchus counterpart any more) 
we read épvtp[oet6t, which looks like a phonetic variant of éAvtpoetdijc 
“lid-like”, “cover-like” (attribute of one of the membranes enveloping the 
scrotum). Rho for lambda is indeed a very frequent phonetic exchange in 
the language of the Greek papyri,10s but the same variation is to be found 
among the manuscripts preserving Ps.Galen’s Jntroductio seu Medicus, 
containing a descriptive passage (XIV 719,5—10 K.) of the same anatomical 
part.ioo9 A similar case is offered by P.Coll.Youtie I 4v_ (http:// 
www.litpap.info/dcelp/64118), a collection of prescriptions dated around the 
Il cent. AD, in which @aodc at |. 8 (“reed bed’’) could be a spelling variant 
of @Aotds (“bark”) just as @mAowdv in Dsc. HI 147 has a variant reading 
@Aobvv.110 In a completely different type of text, POxy. XIX 2221r + P.K6ln 
V 206r (http://litpap.info/dclp/61916), a I-century AD commentary to 
Nicander’s Theriaka attesting interesting textual variants (see below), at 1. 
11,29 we read Bopeitat vs Boteitar codd. (Nic. Ther. 394), which looks like 


the genuine form; the reading of the papyrus is a phonetic variant of the 
@opeitat to be found in the ancient scholia to that passage.111 Once more 
time, the impression is that we are facing a peculiar intersection of multiple 
literacies, emerging at the phonetic level but implying deeper meanings that 
cannot be flattened in a traditional apparatus. 

Even seemingly outright syntactic ‘mistakes’, in such a technical 
corpus as the medical papyri, can conceal deeper levels of meaning: an 
established prescriptive formula like témp yp@ “use with water” goes far 
beyond the apparent anacoluthon (év téat would be expected), becoming a 
distinctive mark of medical recipes, and must be treated accordingly. 112 


4.6.5Transtextuality 


As we saw, quite often linguistic phenomena may be clues to broader 
cultural facts. The complexity of textual phenomena in the Greek medical 
papyri (but not only!) can be effectively described through the concept of 
transtextuality as investigated by Gérard Genette since the Eighties. 
Transtextuality defines all the various possible relationships among texts 
(“all that sets the text in relationship, whether obvious or concealed, with 
other texts”)113 and encompasses several subcategories,114 on which I will 
base the description of the next layers. 


4.7Paratext 


Paratextuality is defined as the relation between one text and what 
surrounds the main body of the text: in Genette’s theory, paratext is mainly 
composed of titles and headings, but we may add any other graphical 
device that comes along the text itself, including punctuation, which is not 
a common feature in papyrus texts and therefore deserves special 
treatment. 115 

In a writing system based on scriptio continua (i.e. not separating 
letters into words), punctuation is a way of facilitating the reading by 
separating words or groups of words. Single, double, triple dots occur 
irregularly with this function; in TEI/EpiDoc they are encoded as “non- 
alphabetical glyphs” with the tag <g> and the attribute type defining their 
nature (e.g. <g type="middot"/>, <g type="dipunct"/>, <g type="tripunct"/ 
>). In Leiden+, the so-called ‘g-types’ are encoded by typing the attribute 
name between two asterisks. This is usually how other lectional signs 


(apostrophe, diastole, stigmai) and all graphical devices (check marks, 
deletion marks, parentheses, line fillers, strokes) work in this markup: a full 
list of what is supported by Papyri.info can be found at 
http://147.142.225.252/paptrac/wiki/gtypes, while new signs are being 
developed specifically for the DCLP. 116 

A small set of other signs, separating not letters or words but entire text 
sections, is encoded as ‘milestones’:117 this is the case with paragraphos 
(<milestone rend="paragraphos" unit="undefined"/>), which has been 
supported by Papyri.info since the beginnings, and with the new additions 
developed for DCLP, ic. koronis (<milestone rend="coronis" 
unit="undefined"/>) and forked paragraphos a.k.a. diple obelismene 
(<milestone rend="diple-obelismene" unit="undefined"/>).118 They are all 
placed between any two lines of text and in Leiden+ are encoded as rows of 
four typographical characters depending on the sign (four hyphens for 
paragraphos: ----, four equals for koronis: ====; four combinations of 
dash + angle bracket for diple obelismene: ->->->->). 

The diple has been categorized as ‘milestone’ as well (<milestone 
rend="diple" unit="undefined"/>, Leiden+: >>>>), though this may create 
some issues, since diplai are frequently used in the margins to highlight a 
specific section, phrase or word, and encoding them as milestones would 
not be semantically correct.119 A similar issue may arise when the 
paragraphos is used between two lines to separate two sections of a text 
but the logical division occurs within the preceding or the following line, as 
in PSI VI 718 = SB XXVI 16458 (IV AD, http://litpap.info/dclp/64564). 
This sheet, likely cut off from a small parchment notebook, contains part of 
a collection of prescriptions separated from each other by inline filling 
marks and interlinear paragraphoi. The last recipe starts in |. 12, following 
the end of the preceding one and after a separator mark, though the 
paragraphos is traced between Il. 12 and 13. 


<Ib n="12"/>A0A@v <g_ type="check"/></lem><rdg><choice><corr><expan> 
ME<ex>1</ex>E<ex>pt</ex></expan>itéa<supplied reason="omitted">>  </ 
supplied> Ao1Wc</corr><sic>nEElTEM@Pwvoc</sic></choice> u<supplied 
reason="lost">a</supplied><Ib n="12" break="no"/>Aayav <expan><ex> 


dpaypdic</ex></expan> <num value="6">c</num>.</rdg></app> oampov o 
<supplied reason="lost">i</supplied> 


<milestone rend="paragraphos" unit="undefined"/> 


<Ib n="13" break="no"/>vov <choice><reg>aoujoat</reg><orig>aoios </ 


—, 


orig></choice> kaAov <gap reason="lost" extent= 


nn | | 


unknown" unit= "character'"/ 


1 [-ca.?-] Traces ca. 3 characters[ -ca.?- ] 


[ -ca.?~ JvuTtoerrak[ ~ca.?- ] 1 [-ca.?- ] Traces ca. 3 characters[ -ca.?- ] 
[ -ca.?- } - ca.9 - [ -ca.?- ] [ -ca.?- ]vutroetrak[ -ca.?- ] 
ti gow xforrm;] [ -ca.?- ] - ca.9 - [ -ca.?-] 
5 [i Tw) owpéTwv Tour. ti gotw Klormn;] 
tiéorw &roSopé; 5 [fh tv] owy&twv Toph. 
[f, SuJ& Tovey Kai Opévwol(v) ti got &trod0p&; 
[owpl&twv Sia0Taotc. [f StJ& tévewv kat byéven(v) 
ti gore StaKkévrn- [owp]atwv SiKoTaotc. 
10 oc; ti oti StakévTn- 
[ore}vit(*) bia BedAdving Tav 10 1G; 
(o]wyétwy Tour. [ove]vA(*) 51& BeAdvng Tawv 
ti ote Stapapn;(*) [o]wpatwv Tour. 


Ti oti Siapagr;(*) 

[S}taxévtnoic Suk Bedd- 
15 [v]ng kal P&ppatoc A pi- 
[tou Stapepopévon KaT& 
[Tohru ErretBourdc.(*) _ 

Trdcalt] Stacpopai orrAnviw(v); 
[1]& t&v oTrAnviwv S.agpé- 
[plouot B: A+) Trax\p&/ Thy BM)Any, 
[Fi kail(*) [Tapa Td oxfpa. 

[ -ca.?- ](*) thy BAnv; 


[S)caKxévtnore Sie BeAS- 

15 [v]ng kai Pappatog F pi- 
[T]ov Stapepopévou kata 
[Todd ErreiBoudae.(*) 
Tréca(t) Suapopai orrAnviw(v); 
[T]& TV oTTANviwy Biapé- 

20 [plouor B- Hi(*) Tra\pa/ Thy G@)Anv, a 
[F Koxi}(*) [Tap& TO oXfpa. 

[ -ca.?- }(*) Tv GAnv; 


(Fig. 8a: visualizzazione del testo nel PE) (Fig. 8b: visualizzazione del testo in DCLP) 
Fig. 6: A nice comparison between the earlier SoSOL preview display and 
the current DCLP rendering of eisthesis in P.Gen. inv. 111, catechism, http:// 
litpap.info/dclp/63819 (BERTONAZZI 2018a, 42). 


In such cases, the sign is reproduced as in the original text, but the 
semantics is odd; moreover, word breaks between two lines separated by a 
paragraphos seem not to be handled by the searching engine. It is quite 
clear that rigid textual units lose relevance when one deals especially with 
technical texts, and a separate layer to record the paratext in all its 
multifarious relations with the text may prove useful. 120 

Blank spaces are a particular category of paratextual devices that 
deserves a thorough reflection. If the main purpose of punctuation is to 
divide text portions, then it is possible to think that any “space deliberately 
left blank [inside a text] is also to be considered as a mode of 
punctuation”.121 The traditional way of referring to deliberate blank spaces 
is the vacat, which is rendered with a <space> tag in TEI/EpiDoc (with the 
very same attributes as the <gap> mentioned above). Of course, encoding 
recurring blank spaces like those deployed in P.Col. IV 122 (official letter, 
181 BC, http://papyri.info/ddbdp/p.col;4;122) to separate almost every 
word from one another would be impossible, if not in a different layer than 
text. Normally, the vacat is to be marked up when it introduces a significant 
break in the text.122 Peculiar cases as in P.Oslo III 72,9 (medical treatise 
about epilepsy, IT AD, http://litpap.info/dclp/63583), where (according to 


the editors’ interpretation) the ancient scribe left a blank gap to pinpoint a 
controversial point, should be further (or differently) annotated in order to 
preserve the original intentions. 123 

The handling of blanks is connected to the problem of how to encode 
ekthesis and eisthesis, i.e. extension and indention of a line with the 
purpose of highlight particular phrases, which has never been taken into 
consideration before for documentary papyri. Among the medical texts, this 
device is frequently deployed in the questionnaires or catechisms. Such a 
text typology provides medical notions in a dialogue format, where a 
question about theoretical definitions or practical procedures is followed by 
amore or less detailed answer. 124 Its use as a handbook, a reference tool for 
the doctors’ preparation, is clear also from the complex set of devices 
employed to highlight the articulation of the text: questions are very often 
indented in eisthesis, and further marked with paragraphoi, line fillers or 
other lectional marks that introduce the answers as well. This mise en page 
reflects the central role played by the question-and-answer structure of the 
didactical tool,i2s and must be preserved when the texts are moved to any 
modern format. This is not only a matter of reproduction. In the overall 
framework of difficulty of recognizing textual genresi26 due to the 
fragmentary state of the scattered sources preserved to us, scholarship relies 
on any possible feature for a better understanding of ancient texts, and 
some very fragmentary texts have been identified as questionnaires just on 
the basis of the presence of blank spaces (P.Oxford Sackler s.n., I century 
BC;127 more recently GMP I 6 and P.Strasb. inv. 849):128 it is therefore 
unconceivable to encode such texts without paying attention to their 
paratextual garment. 

It is tempting, at a first stage, to equate an eisthesis to an initial vacat 
and therefore to encode it like that. However, as we have to encode not the 
visual appearance of the text but its semantic core, we must be aware of the 
fact that we are not describing a certain extent of space intentionally left 
without characters, but a displacement of the line beginning to stress its 
relevance.129 Its specular counterpart, ekthesis, makes the picture clearer: 
by no means can it be indicated by creating weird virtual vacats at the 
beginning of the surrounding lines. The current solution is to mark it as an 
attribute of the line: <lb n="1" rend="indent"/>, which in Leiden+ appears 
as (1, indent) — the same way marginal annotations are tagged (see below; 
for ekthesis the value "outdent" is to be used).130 This seems to work fine, 
and is now fully supported by the DCLP platform also in terms of visual 
display. 

However, a further problem arises if we consider that in some catechisms 


the questions do not start in a new line, but on the same line as the end of 
the previous answers, after a blank space that cannot be considered as a 
vacat for the same reasons as above. In this case, if we tagged the entire 
line as in eisthesis we would not represent the situation correctly, since the 
first part of the line is not really indented. A new solution might be to tag 
the question phrase with the TEI/EpiDoc XML <hi> element, which is used 
to sign “highlighted characters or words”, “with a rend attribute specifying 
the kind of highlighting”,131 In our case, the value of the rend attribute 
would be "eisthesis", i.e. <hi rend="eisthesis"></hi> (or "ekthesis" in the 
other case), which is not supported by SoSOL currently. 132 

The <hi> tag handles also the markup of ancient diacritical signs 
originally added by the scribe,133 which has always been well developed 
since the earlier times of Papyri.info. The canonical cases are accents 
(acute, circumflex), spirits (lenis, asper), diaeresis, either alone or in 
combination, marked with a rend attribute, the value of which corresponds 
to the name of the sign. Leiden+ markup is rather intricate (they must be 
added in the proper Unicode character inside a pair of brackets just after the 
appropriate letter, which in turn must be always preceded by an extra space, 
in whichever position it occurs in the word), but fortunately the editorial 
platform offers quite a helpful menu to automatically perform the task. It is 
worth noting that the presence of ancient diacriticals is noted in the 
apparatus. 

The occurrence of images within the text is currently handled as well. 
The <figure> tag is used, and a free description of the picture can be 
inserted in a nested < figDesc> tag; Leiden+ simply indicates it with the 
free description preceded by a hash mark. In medical papyri this proves 
quite useful when dealing with the cases of illustrated herbals, where the 
extant images can be easily encoded with #plant (© 
<figure><figDesc>plant</figDesc></figure>). 134 


4.8Intertextuality & hypotextuality 


Intertextuality is defined as the relation between parallel text, in the form 
e.g. of quotation or allusion;i35 hypotextuality (with its opposite, 
hypertextuality) as the relation between a text and a preceding one that is 
transformed, modified, elaborated or extended. Due to the high degree of 
both theoretical and practical re-elaboration of medicine — 
“reperformance”, in a sense, to borrow a term created to describe the 
interplay between text transmission and representation in classical drama136 


—, medical papyri show a complex degree of both inter- and hypotextuality. 
Not only are the ‘classical’ medical treatises and handbooks copied 
following the original text, but they are also quoted, or referred, or re- 
elaborated in other writingsi37 (anonymous treatises as well as manuals, 
catechisms or collections of prescriptions, and of course commentaries) and 
excerpted by the late compendiasts (Oribasius, Aetius, Paul of Aegina), 
who took and interwove excerpts from the earlier authors in order to create 
composite texts, with the purpose of assembling the best from previous 
writings. 138 

The interconnection between all such parallel or derived texts is of the 
utmost importance for evaluating the history of medical science, the 
dynamics of ancient textual transmission, and the framework of literacy 
among medical experts, so that an annotation layer that may link the actual 
text on the papyrus to any relevant related passage in other sources would 
be most useful.139 In the cases of papyri preserving ‘literary’ works 
(Hippocrates, Galen, etc.), for example, our fragments quite often do 
provide more genuine readings than manuscript tradition, since they are 
chronologically closer to the source;140 they can therefore support some 
manuscript versions against others, or even preserve previously unattested 
variants, facts that deserve a particular attention. 

A small selection of significant samples will suffice. In POxy. XIX 
22211 + P.K6ln V 206r (http://litpap.info/dclp/61916), the abovementioned 
I-century AD commentary to Nicander’s Theriaka, the extant quoted 
passages generally agree with the more recent manuscripts of Nicander’s 
tradition (= @) against the ancient codex Parisinus II, and show also new 
genuine variants.141 The comments, in turn, do not show many points in 
common with the known scholiastic tradition, and may be traced back to 
the most ancient comment to Nicander’s Theriaka, that by Demetrius 
Chlorus.142 In the Aberdeen Hippocratic papyrus (GMP I 1), already cited 
with regards to the adaptations to the Greek language spoken in Egypt, we 
do find variants already attested in the manuscript tradition (Il. 4-5) but 
also passages completely divergent from the codices (Il. 11-12, where the 
length of the gap and the shape of the following traces exclude the 
unanimous manuscript tradition, which is of course printed in all the 
editions, in favour of a previously unattested variant). 143 

Alignment among parallel versions of the same text, by linking external 
resources providing canonical literature,144 can therefore convey precious 
information and significant analysis tools, and can be well extended to all 
the cases (even documentary ones) of texts preserved in more than one item 
(copies, duplicates).145 It would also solve the problem of encoding 


philological variants and manuscript readings in the papyrological digital 
editions, a challenge faced during the construction of the DCLP database 
and not yet satisfactorily solved.i46 Alignment of ancient or modern 
translations of the medical texts (e.g. Latin or Arabic versions) should also 
be taken into consideration, 147 while a fruitful integration between syntactic 
annotation/analysis and intertextual referrals may be envisioned for the 
most intriguing issues of medical literature, as recently argued by 
Francesca Bertonazzi: 


Panalisi del lessico tecnico dei papiri chirurgici ha portato a individuare 
paralleli testuali tra testo tramandato su papiro e tradizione manoscritta, talvolta 
significativamente stringenti come nel caso di P.Strasb. inv. 1187 e diversi passi 
di Eliodoro ap. Oribasio. Alcuni altri papiri (P.Lond.Lit. 166, P.Gen. inv. 111, 
P.Fuad.Univ. 1, P.Ryl. 3.529), come gia notato dagli studiosi, sono caratterizzati 
da una forte presenza di ‘lessico eliodoreo’ e da alcune peculiarita proprie del 
modus operandi del chirurgo, come la predilezione di interventi chirurgici che 
siano il piu sicuri possibili per il paziente, nonché del modus scribendi, come il 
ricorso frequente alla prima persona — singolare o plurale —, la definizione con 
esattezza delle posizioni ‘topografiche’ della parte operata (dentro, fuori, sopra, 
sotto), e una sostanziale semplicita delle strutture sintattiche usate. Ad oggi, i 
tentativi di attribuire i papiri citati alla paternita di Eliodoro si sono basati quasi 
esclusivamente su criteri lessicali nel confronto tra il testo tramandato su papiro 
e sul capitoli di Oribasio che portano la titolatura ‘da Eliodoro’. Una nuova 
possibile strada offerta dalle nuove tecnologie della papirologia digitale é quella 
costituita dall’annotazione sintattica dei testi: un’analisi pi accurata non solo 
del lessico, che come é noto é la parte piu ‘volatile’ della lingua, ma delle 
strutture morfologiche e sintattiche dei passi del compilatore tardo in sinossi 
con i testi dei papiri, sia pure nella limitatezza delle pericopi testuali preservate, 
potrebbe gettare nuova luce anche su questo aspetto tra i pil incerti quanto 
stimolanti della ricerca. 148 


Re-elaboration is probably the most striking feature of technical texts, 
stemming from oral teaching and then continuously adapting their content 
according to the developments of knowledge. Medical genres like the 
questionnaire or the collection of prescriptions illustrate this framework at 
the best, though we do find plenty of crossreferences in treatises too. 149 
Catechisms (erotapokriseis), for example, are clearly derived from and 
devoted to some sort of oral teaching, as we pointed out above while 
discussing of their paratextual devices. Yet there exists a considerable 
similarity with the literary genre of the “definitions”, connected with the 
research and teaching practice of Hellenic medicine and attested in the 
Greek Pseudo-Galenian treatise Horoi or Definitiones Medicae (XIX 346- 
462 Kiihn) and in the Latin Pseudo-Soranian Quaestiones medicinales.150 


In fact, David Leith has recently distinguished two types of question-and- 
answer medical texts: the proper catechisms, being introductory manuals 
for the student of medicine, and wider treatises on remedies. The 
suggestion came from the similarities detected between erotapokriseis on 
papyrus like P.Turner 14 (http://litpap.info/dclp/63560) and PSI inv. 3783 
(http://litpap.info/dclp/63244) and the excerpts from the physicians 
Herodotus and Antyllus preserved in Oribasius’ Collectiones Medicae.151 
One may also recall the similarities between the abovementioned P.Oslo 
and P.Oxy. overlapping questionnaires, or between the surgical catechism 
P.Gen. inv. 111 (http://litpap.info/dclp/63819) and the treatise known as 
Cirurgia Heliodori,\s2 or also between P.Aberd. 11 (http://litpap.info/ 
delp/63332) and P.Ross.Georg. I 20 (http://litpap.info/dclp/63569), two 
ophthalmological catechisms that certainly derive — with variations — from 
the same source. 153 

Prescriptions are even more complex (and fluid) in transtextual and 
hypotextual relations. I have extensively dealt with transmission of ancient 
medical recipes elsewhere, where I outlined the articulated route from oral 
compositions and draft transcriptions to professional exchange and 
collection.154 Medical prescriptions are fragmentary units, which stem from 
diagnostic-therapeutic practices and oral knowledge that are recorded on 
wax tablets (pinakes), first kept at the sanctuaries of the healing gods, then 
collected by leading physicians (namely Hippocrates) in order to build 
systematic medical repertories.155 At this stage it is hard to trace any actual 
intertextual relation, but when — seemingly in the early Roman age — 
prescriptions start circulating among the physicians, the plot gets intricate. 
Professional doctors exchange single recipes on papyrus scraps with each 
other and collect those fragments of medical knowledge into lists and 
catalogues on parchment booklets, deploying a set of paratextual devices to 
preserve the unity of each prescriptive text. Galen is the best witness to this 
‘research’ activity,is6 as well as of the ‘philological’ attention to the 
pharmacological books held by the libraries, which he himself consulted 
and collated to get the most exact versions of the texts and to compile his 
famous treatises on the composition of remedies. 157 

This workflow is by no means exhausted with Galen: among the 
numerous possible examples, P.Berl.Moller 13  (http://litpap.info/ 
dclp/64268) is a stunning instance. This papyrus, a comparatively large 
portion of a roll from Hermoupolis Magna, dated between the late HI and 
the early IV century AD, is written on the recto along the fibres, therefore 
purposely produced as a collection of medical prescriptions, of which only 
two columns survive. The first one contains a single prescription “to 


prevent hair loss on the head”, identified by MARGANNE 1980 as a 
prescription ascribed by Galen to Heras of Cappadocia, a pharmacologist 
active between 20 BC and AD 20. The text on the papyrus parallels Gal. 
Comp.med. loc. XII 430,8-15 K. verbatim,15s while other variant versions 
of the same remedy are recorded by Galen himself (ibid. XI 435-6 K.) as 
antecedents of Heras’ one.159 Subsequently, CORAZZA 2016 discovered that 
also some remnants of the second column can be identified with other 
recipes by Heras, this time against headache, mentioned by Galen as well, 
with some wording variants.160 Two of them patently parallel Galen, but 
the papyrus is by no means a copy of On the composition of medicaments 
by places: the recipes do not follow the canonical order in which they are 
cited in Galen’s treatise, clearly attesting a work of selection, extraction, 
and thematic re-arrangement, in which each recipe is treated as a unit to be 
managed on its own; moreover, the other two identified prescriptions look 
like variants of Heras’ texts as reported by Galen, thus attesting a ‘fluid’ 
stage of transmission, in which recipes are modified and adapted according 
to the users (Galen himself, as we saw, attests some earlier versions of 
Heras’ recipe against hair loss). It is apparent that this interconnection of 
“living texts”,161 copied and re-copied from original pieces or different 
collections, generates cross-references and inter-quotations that may well 
fall into the cases described in these paragraphs. 162 


P.Ant. 3.126, Fr. A 


13,minf. [a]Gupou € .1 [.4JAwv tou Enpiou [.15] [e]rixee Beppov 

14,minf. [vJbwp kat Aetlorp.Be] KaAWS peta .1 [.15] [e]Aatou KaTaTrAaooE 
15,minf. [.4Japrou .1 [.7].2 ovome .3 [.?] [WA]acowv To kata 

16,minf.- [TAaopa) [.7] 


Fig. 7: Sample markup of marginalia according to the current standards 
(from BERTONAZZI 2018a, 73). 


[tot]e the énvddcewe xpdvorg mel- 
TOV. -piv. 
[ovlg altlat tfc mpoopoptic clave ef 


Fig. 
8: P.Tebt. 272,4—5 (courtesy of the Center for the Tebtunis Papyri, University 
of California, Berkeley). 


4.9Metatextuality 


Metatextuality is the explicit or implicit critical commentary of one text on 
another text. For the same reasons described above apropos of paratext and 
intertextuality, namely the fluidity of medical technical texts, always 
subject to renovation and update after practical and individual experience, 
the practice of annotating is widespread in the medical papyri.i63 The 
annotations can acquire either the organic format of the commentaries, 
autonomous exegetic treatises, the most illustrious examples of which are 
Galen’s commentaries on Hippocratic texts164 (see e.g. Gal. In Hp. Epid. I 
4 = CMG V 10,2,1, p. 78,7-11 Wenkeback, where the author himself 
explains some reasons for compiling commentaries), or the scattered aspect 
of the scholia or marginal annotations: see the examples of P.Ant. I 28 
(http://litpap.info/dclp/60189), fragment of a I[I-century parchment codex 
from Antinoupolis with the text of Hippocrates’ Aphorisms and 
marginalia,i6s as well as of PAnt. I 186 (http:/Avww.litpap.info/ 
dclp/59961), a very fragmentary large-format papyrus codex from the same 
place, dated to the VI cent. AD, which contains sections of Galen’s De 
compositione medicamentorum per genera along with some scanty 
marginal annotations. 166 

In both cases, textual relations are complex.167 Commentaries refer to 
other texts but without exact parallels, except for literal quotations (see 
above); marginal notes refer to the main text without being part of it, so 
that the current treatment in digital editions may be slightly misleading, 
since it allows for marking the marginality of the passage (added to or 
written into the margins), but not the type of relation with the main body of 
the text.16s The XML syntax is clear: marginalia are encoded as plain text 
lines, with the indication of the margin attached to the line number. 169 Let 
us consider, instead, a more complex case, represented — again from 
Antinoupolis — by P.Ant. II 126 (http://litpap.info/dclp/65233): 


P.Ant. 3.126 (VI-VII secolo d.C.) é parte di un compendio sul trattamento 
farmacologico e chirurgico della tonsillite e rappresenta un esempio di 
‘enciclopedia medica’ redatta in epoca bizantina; il ritrovamento di testi come 
questo conferma I’idea che nella pratica medica antica la trasmissione del sapere 
avvenisse tramite la combinazione di fonti tradizionali, tramandate per 
tradizione scritta, e di materiale desunto dalla pratica medica quotidiana e 
registrato proprio dagli specialisti che operavano sul campo. 

Il testo principale, ovvero quello scritto in carattere pil grande nella parte pit 
estesa di papiro, é arricchito da annotazioni nel margine inferiore che 
riguardano alcune terapie farmacologiche da impiegare nel caso dell’insorgere 


delle patologie descritte nel testo, e tale modalita di uso e riuso del testo 
testimonia |’iter con cui il sapere tradizionale era compendiato, arricchito e 
integrato nei libri tecnici dai possessori dei testi. Le caratteristiche di layout, la 
consistenza dei margini (quello inferiore, quasi totalmente conservato, misura 5 
cm) e la scrittura regolare, oltre all’indicazione in alcuni punti degli spiriti, 
lasciano pensare che il frammento fosse parte di un codice di notevoli 
dimensioni e, dunque, di un certo pregio; il tipo di annotazioni riportate nel 
margine, anche in mancanza di notizie piu specifiche circa l’uso di questo 
codice, fanno pensare che il redattore potesse essere un medico piuttosto 
competente o un soggetto forse ancora in formazione ma abituato alla pratica 
medica.170 


The relation between the marginalia and the text is tight, though the current 
markup can be arranged just as follows: 

It is clear that we are not dealing with simple additions to the text, which 
are easily encoded with the <add> tag, further specified — with the place 
attribute — according to the position of the insertion (above, below, left, 
right, interlinear).171 Scribal additions can be effectively utilized under 
particular circumstances, as in the case of P.Oxy. IX 1184v (http:// 
litpap.info/dclp/60175), a I-century AD fragment exhibiting part of a 
collection of Hippocratic epistles likely arranged by theme (the extant texts 
deal with Hippocrates’ invitation to Persia by the Great King, which he 
self-confidently refuses).172 The papyrus contains different versions of the 
Pseudo-Hippocratic letters 3, 4, 4a, 5, and 6a (ed. Smith), separated by 
initial ektheseis, and paragraphoi between each other.173 Ep. 3 is shortened 
at the end, and its ‘canonical’ conclusion has been appended as a 
supralinear insertion flowing into the right-hand margin — this is easily 
encodable with a combination of the two relevant <add> tags. But then Ep. 
4 was transcribed twice, in an abridged version in the main text, flanked by 
a shorter form without the introductory salutation (Ep. 4a), added into the 
right-hand margin and separated from the main body of letter 4 with an 
irregular vertical line. Further below, between letters 4 and 5 (Il. 17-19), 
three lines of comment appear, unattested elsewhere. 

Marginal or interlinear additions merge with comments in a complex 
metatextual net that sometimes overflows into the text itselfi74 and show a 
remarkable ‘philological’ care for the text by the ancient scribes. The case 
of Hippocrates’ fourth letter, described just above, is rather meaningful: the 
marginal text is not a comment (like the following interlinear insertion) nor 
an addition (like the preceding supplementary insertion), it is an alternative 
parallel version, a proper variant of the text presented in the main body of 
the papyrus. In this case, the vertical, irregular line traced by the ancient 


scribe to divide the two alternatives acts as a proper indication of a textual 
variant.175 We do find even more puzzling instances. P.Tebt. II 272v (http:// 
litpap.info/dclp/60048, late II cent. AD) is a fragment of Herodotus 
Medicus’ De Remediis, describing the symptomatology of thirst and its 
treatment; the text corresponds in part to an excerpt of Herodotus Medicus 
preserved with Oribasius’ treatment of thirst in case of fever (Coll.Med. V 
30, 6-7 Raeder = CMG VI 1,1). At a certain point, where the text reads 
aitiat Ths Tpoo@opiic (1. 5), introducing the different reasons for giving the 
sick something to drink, the scribe added two groups of three letters 
between dots above the line:176 *t@v* above tijc, and *p@v* above pic. 
This is not an addition supra or infra lineam, since it is clearly an 
alternative to the syntagm below (plural instead of singular); and since 
nothing appears deleted, it is not clear if the ancient writer wanted to 
correct the text or just juxtapose two different versions of the same 
passage.177 We cannot be sure of what is going on here because this variant 
is unattested in the manuscript tradition, i.e. in Oribasius’ passages quoting 
Herodotus Medicus, which all feature the singular form. We would have a 
scribe correcting the form unanimously preserved by the manuscript 
tradition and replacing it with an unattested variant. The P.Tebt. editors 
speak of “correction or alternative reading”, M.-H. Marganne of 
“hésitation”;178 if we should define it, we ought to call it a ‘scribal variant’, 
just as in the Hippocratic case presented above, as well as in P.Oxy. LVI 
3851 (http://litpap.info/dclp/61917, II-II] AD), a fragment of Nicander’s 
Theriaka (333-4), which at |. 12 reads apeoBiotat[ov] (attested in most of 
the manuscript tradition) with a v added supra lineam between dots, being 
mpeoBvotatov an alternative version attested in some of the manuscripts (= 
Ky). 
It is not easy to deal with these cases digitally,i79 at least with the currently 
available tools, which deploy instead a full set of tags aimed at encoding 
plain scribal corrections, i.e. additions (see above), deletions (<del> tag 
with rend attribute describing the type of deletion: "erasure", "slashes", 
"cross-strokes"),180 and replacements (<subst> tag containing a nested <add 
place="inline"> tag defining the corrected text and an equally nested <del 
rend="corrected"> defining the replaced text).181 

The philological care testified by the cases of ‘scribal variants’ 
mentioned above is even more patent when the text is an autographis2 and 
is equipped with authorial revisions, for which an important contribution 
can come from the XML annotation of genetic criticism phenomena 
recently developed by Elena Pierazzo.is3 Raffaella Cribiore has recently 
showed how genetic criticism — aimed at reconstructing the process of 


authorial constitution of a text — can be successfully applied to 
papyrological texts.184 


4.10Editorial interventions (modern) 


Modern alternative readings and editorial supplements do influence 
linguistic annotation, in that they add data, which are not stricto sensu 
‘original’ to the text. Alternatives produce multiple possible readings, one 
of which is usually the most probable but without full certainty, and the 
other possibilities may well fit the context. Supplements, though most 
likely and in some cases pretty unavoidable, are nonetheless a modern 
contribution to the ancient fragmentary text and deserve a particular 
attention. They can even be incorrect, and thus fall into the third category 
of editorial corrections, which encompass all modern corrections made to 
the readings of previous modern editors. Alternatives and editorial 
corrections are currently encoded as apparatus elements (<app>) defined by 
the type attribute and composed of a “lemma” (<lem>), i.e. the word in the 
text, and a “reading” (<rdg>), i.e. the alternative in the apparatus. In Leiden 
+ they are marked with the |alt| and the |ed| tag respectively. Such a markup 
strategy works fine from the ‘philological’ viewpoint, since it provides a 
main reading — supposedly the most correct — and a set of critical 
alternatives, either proposed by the same editor or sedimented through 
years of scholarship, with the possibility to indicate the authorial 
responsibility for each reading (resp attribute in the <lem> tag).185 
Nevertheless, the impossibility to search for combination of words 
including the terms in the apparatus makes this choice rather uneasy for the 
purposes of digital databanks, while different layers of text, each one 
featuring a single textual alternative, may enhance the digital representation 
of the papyrus, especially when considering that editorial interventions 
occur more frequently in the medical papyri than in the documentary ones, 
for the peculiar attention to the editorial history that characterizes the items 
of the Digital Corpus of the Greek Medical Papyri.is86 

On the other hand, supplements are tagged as such with the <supplied> 
element, and a reason attribute that defines the type of integration: "lost" if 
the original text is lost (Leident+ square brackets), "omitted" if the original 
text was left over by the ancient scribe (Leiden+ angle brackets), "parallel" 
if the text is inserted on the ground of a parallel text (Leiden+: pipes + 
underscores |__| ). The opposite case (removal of ancient surplus text) is 
marked with a <surplus> tag (Leiden+: curly brackets). In this case, 


integration with the text layer is granted by the fact that the <supplied> tag 
indicates a text portion. This is even clearer if we compare it with the tag 
used to mark unsupplied lacunas, that is a <gap> tag with a reason attribute 
set to "lost" (see above).187 

The case of the gaps is indicative of the semantic difference between 
digital and paper edition. In a printed critical system, both supplied and 
unsupplied lacunas are marked with square brackets because the focus lies 
in the descriptive layer of the papyrological fact: a certain missing part of 
the text, which may be recoverable or not. In a digital critical context, we 
need to define whether a lacuna bears a textual meaning (1.e., a supplied 
text) or not (i.e., a gap in the text). Leiden+, following the printed 
conventions, adopts square brackets for both, in order to help the users; but 
the system automatically chooses the appropriate XML code according to 
the content of the brackets. Therefore, when the papyrus displays a partially 
supplied gap, which is enclosed by the very same pair of brackets in the 
printed edition, in the digital edition the two different parts (supplied and 
unsupplied) must be kept separated since they mean two different facts. 
Leiden+ brackets are different than Leiden printed ones also in that the 
former must be always opened and closed at each gap, while in a printed 
edition they can be left open (or unclosed) if their exact extent is unknown. 

Somehow ambiguous, in conclusion, is the treatment of modern 
corrections in the case of misspelled words. Though the typical treatment 
involves the |reg| and |corr| tags according to the type of intervention (see 
above), the use of traditional Leiden conventions (angle brackets for 
supplements of omitted letters, curly brackets for removal of surplus 
letters) is admitted in case of outright diplographies (“where the letter(s) is 
genuinely superfluous”, so say the guidelines)iss or trivial omissions (for 
which it is preferred to the |cort| tag). 


4.11lmage 


When available, the addition of a digital picture is fundamental for a 
complete evaluation of the papyrus. The advanced possibilities of virtual 
objects, of which I discussed elsewhere,1s9 could be further enhanced by 
aligning text and image, a procedure that has been successfully attempted 
by the Anagnosis project at Wurzburg. 190 


Fig. 9: PMich. XVII 758 H r/v: main text with recipes taken from other 
authors; marginal annotations and additions with a reference system of 
coronides and other graphical marks (YOUTIE 1996, PI. 8). 
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Fig. 
10: Printed edition of P.Mich. XVII 758 H r (YOUTIE 1996, 59). 
P.Mich. Inv. 21 H verso 9 x 8 cm. 
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11: Printed edition of P.Mich. XVII 758 H v (YOUTIE 1996, 61). 
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LC), [-ca?- J 
eveooals]) Bueila JC) [-ca.?-] 
auriy [_.] Sorte [-ca.?-] 
1 éuttrAcotpos Po8[ivn - ca.3 -] 
5 xabapas(*) Ta TETAA[A Hooe-] 
trou kAddou Tou 8[ -ca.?- ] 
(hand 2) [E]ptAactos(*) apodeu[o]pévea Gaca[opévep - ca.14- J 
(hand 2) (added at left: ((coronis)) ) wot) Té> T6VED 
(hand 2) 5é0x(a)(*) knpot (ovyy ) a [-ca.?-] 
1@hand 2) (added at left: xp(notSs)) ((cross)) (*) niguos EoB(eoa(?)) (Spaxu) aa [-ca.?-] 
(hand 2) xaABav(ns) (ovyy ) B nul -ca.?- 1) 
(hand 2) xaABavnv kai [-ca.?- ] 
(hand 2) [| acacap] Sul -ca.?- ](*) 


(hand 2) Bin8r\oas xp [ -ca.?- ] 


1 [-ca.?-] kai m[p]os tous 
[koATrous TpCopa] tou[s] Kai <émri(?)> Ta(v) 
[EAkddv ouvovAo]upEéveov(*) Kai 
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4 H.r.1-2. or [ouv] |évesca[s] 

4 H.r.2. or veplyas] 
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4H.r.13. or Bu[eig -ca.?- ] 

4 Hv.3. or [étrouAo]upéveov 

4 Hv.3,md. or E(npiov) 

4 Hiv.5. corr, cx xwébAas 

4#Hv-.5. 1. kal 


Fig. 12: DCLP 
digital edition of P.Mich. XVII 758 H r/v (http://www.litpap.info/dclp/59332). 
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13: Tentative ontology model for P.Mich. XVII 758 H r. Some layers are 
simplified; note that in the metatext layer (below) the hypotext and the 
hypertext are merged (and some editorial supplements and alternatives are 
missing) in order to give space to the annotation of abbreviations and 
symbols, which clearly shows their intensive deployment by the second 
hand (= the owner of the codex). Noteworthy is [€]trAaoTtoc (corrected from 
[E]UIAJ, |. 7), which is a substandard spelling variant of the more common 
EuTTAGOTpOG: as already noted by YOUTIE 1996, ad loc., “EuTTAaoTOG, 
according to Galen (XIII 372 [K.]), was an earlier form of EutTrAaoTpoc”. 
Moreover, the entire metatextual paragraph must be noted as written in 
ekthesis in the bottom margin, which is marked line by line in the Leiden+ 
code. Here, it would suffice to encode it as a marginal metatext and to 
connect it to an ekthesis paratext layer (see the following sample). 
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ekthesis Fi . 
14: Tentative ontology model for P.Mich. XVII 758 H v. Some layers are 


simplified; in the metatext layer (below) the hypotext and the hypertext are 
merged as in the preceding sample, but here symbols are not handled in 
order to give space to the ekthesis paratext layer and to the intertextual 
layer, since the recipe added to the bottom margin (Il. 6-9) closely recalls (in 
the typology and number of the ingredients and in their quantities) a 
passage of Paul of Aegina (VII 17,31). Note also how the right-hand-margin 
additions are handled as metatext layer connected to II. 3—4 of the main 
text, whereas the Leiden+ markup does not handle the situation properly 
(marginal lines can be added within the text, or at the end, but in both cases 
some metatextual information gets lost). Quite interestingly, the scribal 
phonetic correction yiyétAac for xipél6llAac (I. 5) attests to a preference for a 
form used by Paul (e.g. Ill 79,1) rather than other medical writers (e.g. Orib. 
Coll. IV 615,19; 620; Syn. VII 45; Gal. XIII 380,5 K.; 383,17 K.). 


Paul.Aeg. VII 17,31 


is intertext to 


SConcluding remarks 


The so-called Michigan Medical Codex (P.Mich. inv. 21 = P.Mich. XVII 
758, http://www.litpap.info/dclp/59332)191 resumes at the best most of the 
preceding arguments. It is a [V-century small-format papyrus codex, of 
which thirteen leaves survive to an amount of twenty-six pages, in which 
numerous recipes are collected — seemingly — according to type of 


medication (pills and lozenges, then wet and dry plasters, at least in the 
extant pieces). Commissioned by a practicing physician,192 the document 
shows various degrees of textual interventions. In the original writing, 
recipes start with an indented heading, declaring the type of remedy, and 
are separated from each other with lines and small blank spaces; they 
typically contain the list of ingredients, followed by directions for 
composition and use. Many prescriptions are ascribed to famous doctors, 
showing 


correspondences with recipes for plasters in the collections of Galen, Oribasius, 
Aétius, or Paul of Aegina that have come down in the manuscript traditions, 
highlighting the striking degree of continuity among ingredients and their 
relative proportions from hand-written copy to hand-written copy over many 
centuries: the presence of plasters from a variety of different physicians 
suggests that the basic text of the codex was combining and taking its shape 
over considerable time. 193 


Then, the interventions by the owner of the codex: 


First he collated the text of his newly-made copy against an exemplar, making 
corrections in addition to the items already corrected by the scribe, and then he 
went on to more than double the contents of the codex by filling the margins 
with additional recipes for pills to medicate bodily ills and plasters to medicate 
wounds and lesions of every kind. Because empty space was limited, he 
emphasized separation between recipes through lines and marginal markers. 194 


Intertextuality, hypotextuality and similar connections merge together, 
creating a very complex and unique clockwork: “although individual 
recipes in a collection on papyrus often resemble items in the known 
authors, each extensive collection on papyrus has thusfar proved to be a 
unique assemblage”.195 The paratextual function of critical and lectional 
marks stresses the “composite” structure of the text,196 while authorial 
corrections and phonetic variants are not absent from the textual level. 

Let us compare a part of the printed edition with the corresponding 
digital edition currently featured in the DCLP,197 followed by a tentative 
proposal of (partial) ontology network to describe the multiple textuality of 
the sample. 

Admittedly, printed or printed-like media are physically limited as to 
dealing with complex degrees of textuality, and adopted the critical edition 
model as a way of fixing a text for scholarly purposes. On the contrary, 
ancient textual criticism -— recognizable in the commentaries, the 
annotations, the philological interventions, the paratextual care deployed by 


the ancient scribes and scholars — was apparently a way to pass down 
knowledge, i.e. a means of text transmission rather than text reconstruction 
and fixation. Nowadays, thanks to the digital tools, we do have the 
occasion to develop digital infrastructures in a _hyper-dimensional 
cyberspace to overcome traditional criticism and its shortcomings, and to 
conceive a digital critical edition with deeper and deeper levels of text 
analysis (markup tagging, linguistic or semantic annotation layers, in-text 
information). 198 

As BODARD — GARCES 2009 argue, a major advantage of digital 
editions (namely the papyrological ones) is the possibility to get back to the 
materiality of texts, avoiding the philological necessity of reconstructing an 
archetype and focusing on text transmission instead. “[A]ttention would be 
better focused on how to present a text with multiple manuscript witnesses 
to a reader in a digital environment’: 199 


Digital editions may stimulate our critical engagement with such crucial textual 
debate. They may push the classic definition of the ‘edition’ by not only 
offering a presentational publication layer but also by allowing access to the 
underlying encoding of the repository or database beneath. Indeed, an editor 
need not make any authoritative decisions that supersede all alternative readings 
if all possibilities can be unambiguously reconstructed from the base manuscript 
data, although most would in practice probably want to privilege their favoured 
readings in some way. The critical edition, with sources fully incorporated, 
would potentially provide an interactive resource that assists the user in creating 
virtual research environments.200 


Thus, the authors hop[e] that digital or virtual research environments would 
support the creation of ‘ideal’ digital editions where the editor does not have to 
decide on a ‘best text’ since all editorial decisions could be linked to their base 
data (e.g., manuscript images, diplomatic transcriptions).201 


Similarly, NICHOLS 2009 states that the ideal of the archetype text and 
textual criticism is an “artefact of analogue scholarship” consequent to the 
limitations of the printed pages. Conversely, 


[t]he Internet has altered the equation by making possible the study of literary 
works in their original configurations. We can now understand that manuscripts 
designed and produced by scribes and artists — often long after the death of the 
original poet — have a life of their own. It was not that scribes were ‘incapable’ 
of copying texts word-for-word, but rather that this was not what their culture 
demanded of them. [...]. [I]t requires rethinking concepts as fundamental as 
authorship, for example. Confronted with over 150 versions of the work, no two 
quite alike, what becomes of the concept of authorial control? And how can one 


assert with certainty which of the 150 or so versions is the ‘correct’ one, or even 
whether such a concept even makes sense in a preprint culture.202 


Thus, the digitization of manuscripts and the creation of digital critical editions 
have not only provided new opportunities for textual criticism but also might 
even be viewed as enabling a type of criticism that better respects the traditions 
of the texts or objects of analysis themselves203. 


Consider also the reflections of CAYLESS 2010 about the prominence of the 
transmission of content on its external appearance: 


[p]agination is a relatively fragile construct in the digital age”, and textual 
“accretions” like commentaries, glosses and marginal notes, progressively 
gathered around the main text in its historical transmission, can be effectively 
encoded and represented in digital editions that not simply replicate print 
technologies.204 


When we note (again after CAYLESS 2010, 162) that — functionally and 
theoretically — traditional commentary is a hypertext in print,20s everything 
comes full circle, and it appears clearly how the new technologies can 
produce a very similar outcome as the ancient textual criticism described 
above. It can be argued, therefore, that a digital critical edition can develop 
into something completely different from the somehow ‘old-fashioned’ 
printed critical edition: namely, a further step in the fluid textual 
transmission of ancient sources. 
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Notes 
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8: TM 60411: Homer, /liad VII| 433-47. Image © BerlPap. Berliner 
Papyrusdatenbank. 


Anagnosis, Herculaneum, and the Digital 
Corpus of Literary Papyri 


Fig. 


This articles stems from two separate talks delivered by the individual authors at 
the conference “Greek Medical Papyri. Text, Context, Hypertext” (Parma, 2-4 
November 2016). Ast is responsible for the part on the DCLP, while Essler is 
behind the sections on Herculaneum and Anagnosis. Both authors read and 
commented on the complete article. 


10verview 


The Digital Corpus of Literary Papyri (DCLP) initiative was launched in 
July 2013 with support from the NEH/DFG Bilateral Digital Programme 
and upon successful completion of a one-year planning grant under the 
same program.206 The project, which ended in August 2017, built the 
necessary framework for a large-scale corpus of literary papyri on the basis 
of infrastructure already in place at www.papyri.info for documentary 
papyri. And by “literary papyri” we mean both canonical literary genres, 
such as epic, lyric, drama, oratory, etc., and so-called para- or subliterary 
texts, whether magical, medical, or school. All content has been encoded in 
the well-established XML-TEI format known as EpiDoc. An attempt has 
been made to customize search, browse, and editing functionality to the 
needs of individuals who deal with literary texts. At the same time we have 
wanted to encourage engagement with all extant papyrological sources, 


both literary and documentary. As a result, users can search across the 
entire corpus of texts in the Duke Databank of Documentary Papyri 
(DDbDP) and DCLP. 

Despite being headquartered in New York and Heidelberg, the DCLP 

has profited from collaboration with a large number of researchers and 
institutions across Europe and the United States. Mark Depauw and Willy 
Clarysse in Leuven shared metadata belonging to the Leuven Database of 
Ancient Books (LDAB) for nearly 15,000 objects. This information 
constitutes the backbone of each DCLP record (Fig. | with metadata from 
LDAB). 
Similarly, cooperation with Duke University’s Duke Collaboratory for 
Classics Computing (DC3) allowed the DCLP team to build on the 
standards and tools in place at Papyri.info. Holger Essler and his team in 
Wiirzburg, together with Gianluca Del Mastro in Naples and Daniel Riafio 
in Madrid, were responsible for the addition of hundreds of files containing 
bibliography, links to sketches, engravings and photographs, and 
transcriptions of Herculaneum papyri. The Parma Medical Project, under 
the leadership of Isabella Andorlinit and Nicola Reggiani, produced 
extended editions of over 200 medical papyri. Furthermore the project 
benefited from the participation of numerous students and scholars who 
have both entered transcriptions and vetted submissions. 

Before detailing what an individual DCLP record might contain and 
what the project seeks to cover over the long term, we will first speak more 
generally about the nature and scope of the initiative. The DCLP is not the 
first large-scale online resource for classical literature. The Thesaurus 
Linguae Graecae and the Perseus Digital Library both offer searchable and 
browsable transcriptions of ancient texts.207 The most obvious difference 
with these initiatives is the fact that DCLP covers only papyrological 
evidence, including papyri, pre-medieval parchment, ostraka, tablets, 
dipinti, and other non-inscriptional evidence. It does not incorporate 
medieval manuscripts, and its interest is not strictly in the ‘text’ per se, but 
rather in the text as it appears on any given material substrate. In this 
respect, it focuses on the inscribed object as a whole, and tries to account 
for extra-textual elements such as layout and non-verbal signs (e.g., 
paragraphoi, diplai, etc.), in addition to the written words.20s As a result, a 
higher premium is placed on accurate decipherment of all elements of the 
textual witness. This is not to say that a photographic representation of 
these elements is offered in the HTML text, but links to photographs are 
provided when possible, so that the user can observe all features of the 
inscribed object. 


We have also tried to make it easy to discover content in the DCLP. 
Text-search functionality is the same as in Papyri.info, as are many of the 
browsing options, and we have retained the faceted browsing capability. In 
addition to finding texts by their publication numbers, provided the 
publications are known to the Checklist of Editions  (http:// 
www.papyri.info/docs/checklist), one can also locate them by TM numbers 
(Fig. 2 with three browsing options). 

The latter is the most effective means of finding a known object, since 
every item has a TM number, but prior knowledge is a prerequisite, and in 
that respect it hardly can be described as a true browsing function. The 
author/works browsing option, on the other hand, represents an entry point 
that should be welcomed by the literatureminded user who wants to see 
how many papyri by a known author survive (Fig. 3). In addition to giving 
the names of extant works by known authors, the system also says when 
there is a Greek text available. Here again, though, DCLP is not charitable 
to ignorance: works by unknown authors do not appear in the list. 

DCLP is not only a portal to information about published papyri, it is also a 
curatorial platform. Using the same editor employed at Papyri.info, the 
system allows users to propose emendations to published texts and to 
contribute introductions, commentaries, translations, as well as updated, 
machine-readable bibliography. All new content is vetted by members of 
the editorial board. The DCLP instance of Berlin P. 12310, an ostrakon 
preserving five verses of Theognis (vv. 434-8) plus an unidentified comedy 
fragment, is an example of an emended text.209 The Theognis verses were 
thought by the original editor to be inferior to those preserved in other 
witnesses, one of which is Plato’s Meno (95e). Even though the first lines 
are transposed and thus differ from the Theognidean manuscript tradition, 
the text is not as banal as the original editor thought. The ordering of the 
lines on the ostrakon agrees with the text found in Plato and the unique and 
incomprehensible reading of mGAww that is printed by the first editor of the 
ostrakon in line 3 has turned out upon closer examination by Julia 
Lougovaya to be a mistaken reading.210 The sherd actually has moAAovc, 
which is attested in all other witnesses. Lougovaya has emended the online 
text accordingly (Fig. 4, http://litpap.info/dclp/62823). 

The Parma medical project has been a significant contributor of new 
content, including extended editions of many papyri. P.Yale II 134 is 
representative of this work.211 The introduction to the online edition, which 
is authored by N. Reggiani, briefly describes the content of the text, which 
is a set of iatromagical prescriptions. This is followed by the edition, 
critical apparatus, and commentary. The header gives bibliographical 


references and information about the date, provenance, physical 
dimensions of the fragments, etc. It also contains a link to an image of the 
papyrus at the host institution. While mostly based on previous printed 
editions, the DCLP record is itself a stand-alone edition. 


2Herculaneum 


The texts of the Herculaneum papyri were also encoded on the basis of 
a reference edition, but provide additional bibliography and links to 
images. Our starting point was the Thesaurus Herculanensium Voluminum, 
the first full-text database of the Herculaneum papyri. It currently contains 
26 texts with some 20k lines that were entered jointly by the Centro 
Internazionale per lo Studio dei Papiri Ercolanesi (CISPE) and the 
Wiirzburg Institute of Classics.212 Since we firmly believe that uniting the 
Herculaneum papyri with the literary papyri on a single platform is an 
important step towards the unity of our discipline and, in particular, 
towards the development of Herculaneum Papyrology, all texts were 
transferred to DCLP and further work was and will be based on this 
database.213 Currently DCLP comprises 117 editions of Herculaneum 
papyri and can be considered fairly complete for this group. 

Since new editions of Herculaneum papyri tend to take decades, it was 
necessary to refrain from reediting or rechecking readings on the original 
papyri. In order to make the texts available as soon as possible we had to 
compromise even further: text entry was strictly based on the most recent 
comprehensive editions. Especially where readings are highly disputed, it 
would have been impossible to decide without thorough study of the whole 
papyrus. Instead, we were striving to provide a complete bibliography of 
editions of Herculaneum papyri. Currently this first-ever complete index 
comprises more than 1,400 records, of which 387 are reference editions, 
the central focus and basis of the digital text. In general, there are four 
categories (see the metadata in Fig. 5). 
a)Reference editions: the bases followed for text entry 
b)Previous editions: any edition predating the reference edition, which may 

be consulted in the future for the apparatus and concordances 
c)Partial editions published after the publication of the reference edition 
d)Readings: individual readings without establishing a _ syntactic 

connection, published after the reference edition 


Although many partial editions provide substantial improvements to the 


text, their incorporation was not feasible at this stage. Besides, the current 
software does not allow to reference parts of text to particular editions. 

The prerequisite for a uniform data structure was the establishment of 
the relationship between the canonical numbers of Trismegistos (http:// 
www.trismegistos.org) and the LDAB and the traditional inventory 
numbers of the Herculaneum papyri (there referred to as Gigante Numbers, 
following the Catalogue edited by him).214 While Trismegistos and LDAB 
assign an identification number to each ancient book or scroll, the 
inventory numbers of the Herculaneum papyri refer to the fragments as 
they were inventoried in the 18t century. Several scrolls were already 
broken in pieces at the time of the excavation, some were cut to facilitate 
unrolling, and every piece, be it a stack of several layers from the outer part 
of the scroll or a part from the central cylinder, was assigned a different 
inventory number. Over the years, these pieces had a story of their own, 
and might now be in very different condition. In addition, difficulty in 
discerning the text on the darkened surface also resulted in miscataloguing 
single fragments. Thus there are several cases where a single inventory 
number contains fragments belonging to different papyri while often a 
single scroll (corresponding to one TM number) has to be reconstructed 
from several inventory numbers.215 In several cases fragments that have 
been edited separately are now proven to belong to the same scroll. An 
example is TM 62499, containing Philodemus De rhetorica II (cf. Fig. 5). 
The scroll can be reconstructed from five inventory numbers, but no 
comprehensive edition is available. We have thus limited ourselves to 
assembling the text from the four reference editions that cover a maximum 
of the text preserved. Since they were published by different editors at 
different times (from 1892 to 1977), it is not surprising they differ 
considerably in scope and method. 

As a consequence of their preservation and the combination of fragments, 
the surviving remains of a Herculaneum papyrus are on average 
considerably larger than those of the Egyptian fragments. They normally 
exceed 600 lines and arrive at a maximum of nearly 4,500 lines for 
Philodemus De musica IV, distributed among more than one hundred 
fragments. Since for many fragments several images are available online, 
the traditional separation of text and metadata as respected in the display of 
Papyri.info resulted in extremely long lists of information without any 
visible connection to the corresponding texts. To overcome this difficulty 
we introduced a system of linking each fragment or column with specific 
metadata by virtue of a corresp attribute (Fig. 6). This allowed us to include 
links to the more than 6,000 images of Herculaneum papyri or their 


apographs that are available on the internet. 

By virtue of the corresp attribute the data about an image is then displayed 
together with the text of the fragment it depicts (Fig. 7). This system is 
followed throughout in order to link a fragment with specific image files, 
whereas Jmages in the header refers to an internet resource (mostly the site 
of a papyrus collection) where images are available. In the screenshot, one 
can see under the title the inventory number of the papyrus and its 
corresponding fragment, as well as information and links to images of 
various types. Furthermore, in fragment 4 there is additional markup at the 
beginning of the line: the lower square brackets mark text supplied on the 
basis of a parallel tradition, which has been drawn on in order to 
supplement parts lost in the original. 

Digitization of the Herculaneum papyri was greatly facilitated by 
collaboration with Daniel Riafio, who is using his software AristarchusX 
for grammatical analysis and author recognition in the Herculaneum 
papyri.216 He transcoded all the texts previously available at the Thesaurus 
Herculanensium Voluminum and many others. His work yielded further 
improvements, such as automatic lemmatization and spell-checking of the 
text. At a first stage we had included lemmata and numbered single words 
in order to allow linking to his software. However, this further level of 
information created problems both to the human reader of the XML and to 
the transformation stylesheets of the Leiden+ converter. We thus decided to 
suppress it for the time being while exploring solutions with stand-off 
markup. 


3Anagnosis 


The Anagnosis project may be considered the first project user of DCLP as 
it natively builds upon the texts encoded there while it also aims at 
expanding and enlarging coverage of ancient authors. Being part of the 
Kallimachos project (http://kallimachos.de), whose objective it is to create 
a regional centre for digital edition and quantitative analysis in Wurzburg, 
Anagnosis is working on image analysis. The aim is to link illustrations and 
transcriptions of papyri or, more precisely and technically, to link the edited 
texts of Greek literary papyri from the full text database of the Digital 
Corpus of Literary Papyri with the digital images of the originals provided 
by the individual collections. 

By default, the link under /Jmages in the metadata taken from LDAB 


leads to the page of the respective repository or collection that provides the 
images. In total, there are some 12,000 digital illustrations available 
covering approximately 10,000 Greek literary papyri (TM numbers). 
However, the distribution is uneven: while no images of many literary 
papyri are available, there can be hundreds of a single Herculaneum 
papyrus, reproducing different fragments and columns, which were taken at 
different times and with different methods. Thus 152 Herculaneum papyri 
are covered by 6,000 illustrations, and more than 3,000 of them are 
engravings from the 18 and 19th centuries. It was partly for the needs of 
and thanks to the data provided by Anagnosis that the new display of links 
to individual images above single fragments was introduced. This makes it 
possible to automatically create a parallel display of image and text (Fig. 
8). 

The output is created entirely from separate digital sources and is loaded ad 
hoc for further processing. The image is retrieved from the server of 
BerlPap, a project of the Berlin Papyrus Collection, whereas the transcript 
comes from DCLP. Thus Anagunosis links the two resources by virtue of the 
TM number and the fragment number, if applicable. As explained above, 
these links are stored in the corresp-attribute in DCLP. 

Some might find already this parallel display useful, but in the 
perspective of Anagnosis this represents just the starting point of image 
analysis, with the ultimate goal being to link each letter of the transcript to 
the corresponding zone in the image. This will allow us to extract and 
display a complete alphabet of the letters present in the papyrus, a tool 
traditionally used for deciding on the reading of damaged areas. Such a 
complete set of images for each letter will further enable automated 
examination of the uniformity of shapes and of the scribe’s consistency. In 
addition, there is the possibility of automated graphic reconstruction to 
evaluate the spacing of proposed supplements. 

The development of the underlying software was carried out at the 
German Research Center for Artificial Intelligence by Saqib Bukhari and 
will be published in specialized journals related to image analysis.217 The 
software will be freely accessible for use and available for download. 
Anagnosis will only produce snippets of single letters and coordinates in a 
stand-off TEI format. Thus by working with the software the user is not 
requested to give away images of original papyri, but he may contribute by 
enlarging the pool of letter snippets that may be used for further research. 

Thus the aim of Anagnosis is not classical OCR on illustrations of 
manuscripts, but exactly the opposite: starting from a line-exact copy, as it 
is present in the Digital Corpus of Literary Papyri, the corresponding letter 


zones within the illustration are referenced to the known letters of the copy. 
The main reason for this is, of course, the many technical difficulties that 
stand in the way of automated text recognition of handwriting and for 
which there is still no satisfactory solution. We therefore use the existing 
material to go a simpler way first. At the same time, Anagnosis works with 
the same components that are essential for character recognition: the 
original image, the mapping vectors and the transcript. The only difference 
is in the direction of the assignment. Anagnosis thus creates a large 
annotated corpus of correct OCR results from papyri, even though these 
results came about through detours. This corpus may then be used in future 
projects as training data for machine learning. 
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Lajos Berkes 


ddbdp.2015.1 = HGV P.Got. 54 (DDbDP 2015 1) = Trismegistos 36154 = gothenburg.apis.45 = p.got.54 


metadata) texi¥) open in editor Canonical URI: http://oapyri.info/ddbdo/ddbdp:2015:1 
HGV cote TM data!) APIS catalog record) transcription) images) transtation¥! 


HGV: P.Got. 54 (DDbDP 2015 1) [source] [xml] 


Title A Letter from Athanasios the Scholastikos Mentioning Constantinople 
Publications P.Got. 54 (DDbDP 2015 1) _[Morein series P.Got. 
P.Got. 54 
Support/Dimensions Papyrus 
Origin unbekannt [More from unbekannt] 
Material Papyrus 
Date 2. Halfte VI- 641 [More from the period between 551 CE and 641 CE] 
Commentary Neuedition bei PN: <http://www papyri.info/ddbdp/p.got;:54dgtl/>. 
Subjects Brief; Athanasios, der Scholastikos, an N.N. 
Images wri.info/apis/gothenburg.apis.44... a 
License (I © Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden Agyptens, This work is licensed under a Creative C 
Trismegistos: 36154 [source] 
Publications P. Got. 54 
Inv. no. Gdteborg, Universitetsbibliotek 38 
Date AD 500 - 699 
Language Greek 


Provenance Egypt [found & written’ 
Catalog Record: gothenburg.apis.45 [xml] 


Title Lettre 

Summary Christian letter 

Citations P. Got. 54 deser. [TM 36154] 

Inv. Id P. Got. inv. 38 

Support/Dimensions Pap.; 13x 11em 

Origin Provenance unknown (Province of Egypt) [More from Provenance unknown (Province of Egvot) 

Provenance Papyri Gothoburgenses were, with the assistance of prof. Wilhelm Schubart, acquired partly from a private dealer in Germany in th 
Chronique d'Egypte 13 - 14, 1932, p. 327 

Language Greek 

Date VI-VILA.D. [More from the period between 500 CE and 699 CE] 

Note (general) Location: Gothenburg University Library 

Note (general) Source of description: Recto and verso 

Note (general) Recto: 7 lines against the fibres; Verso: 1 line with the fibres 

Fig. 1: The presentation of metadata. 


Introduction 
A Letter from Athanasios the Scholastikos Mentioning Constantinople 
Rodney Ast - Lajos Berkes 


P.Got. 54 (=P.Got. inv. 38) was published in 1929 by H. Frisk as a descriptum without commentary or 
translation. The address on the back was not transcribed at all. We therefore offer here a full edition of 
the complete papyrus. 


What survives is the upper left part of a business letter measuring 13 (w) x 11 (h) cm. Writing runs 
against the fibers, transversa charta, on the front and along the fibers on the back. Given the fairly 
clean break on the right side of the document, we suppose that the papyrus broke along its central fold, 
meaning that ca. half of each line is lost on the right. This would give us a reconstructed width of 26 em 
for the document, a reasonable height for the original roll from which the sheet came. Blank spaces are 
visible in several lines (see especially 1. 5), apparently employed as punctuation. The script points to a 
date in the late 6th, cf. P.Heid. 3 247 (5 - 25 Nov. 598), or first half of the seventh century, cf. P.Oxy. 16 
1843 (623); the address on the back is in a stylized script also typical of the period. The mention of 
Constantinople in 1. 7 implies that the letter was written before the Arab conquest (641) and not under 
Persian occupation of Egypt (619-629). The papyrus’ provenance may well be the Arsinoite nome, but 
we suggest this only because the majority of the Gothenburg papyri come from the Fayum. There is 
nothing in the text itself that hints at where it was written 


Two people are mentioned, a man called Hypatios (1. 6) and the sender, a certain scholastikos named 


Athanasios (cf. 1. 8n.). The addressee, whose name is no longer extant, was a high-ranking person, gothenburg.apis.45.r 
partiaps ai dine, a2 suggested bythe rataremce toni as.s7nEpoxt| (chign)_ The igh repsstex Of the: word ‘Notice: Each library participating in APIS has its own policy concerning the use and rep| 
points to a context in the upper echelons of the Byzantine administration, and the involvement of a lassi rsitact isle dag Es cleiicn Kreu Wlaita eacny feces ARS oe PAST 


scholastikos fits this scenario. In preparing for his career, Athanasios will have received training in 

rhetoric and at least an elementary law education (see CPR 24 4.4n., with further bibliography). We = 

presume, therefore, that he was either a professional lawyer or a high-ranking administration official. HGV 36154 Translation (Englisch) [xml] 
r 


In keeping with the high register of the document, there is a reference in line 7 to the city of z 
Constantinople in the phrase kata tm v Baovkiba tav 1[dAewv. Designation of the city as i Bacv ¢ 


tav 76Aeav is found in other documents, see, e.g., PS1176.5 (6 May 572 or 573) and P.Firene 3.15.3 # The dispute about 

(end half of 6th c.), and there are numerous examples in late authors, as a search in TLG of kata” tT) V ...on the 2oth of the month of Choiak to the... 

BuousiSce tav ndAewv demonstrates. Although he considered Baowiéa, H. Frisk opted in the ed.pr. to. [your] eminence the judgments(?) and . . . 

print kata” ty v, vida tev v[; he abandoned the reading presumably because the first letter of the set in motion in sum ... 

word does not look like a typical beta. Dieter Hagedorn, however, also proposed kata ™ v BaowiiSa to state; but since God has granted .. . 

tv n[dhewv, stating in an email that "Beta hat in der Spatzeit oft ein abenteuerliches Aussehen." We the proper procedure against(?) Hypatios the gloriosissimus(?). .. 


2: The layout of the edition: introduction, image, and translation. 


F 1a GupioBntndévta évexev T[ -ca.?- ] 
TA cikGdi TOO Xoiak yN[v]dc eic tv -ca.?- ] 
UTrepoxnA(?) Tag S[i}oAiweic(*) kai tof -ca.?- ] 
KeivnBévtac(*) év TH Keporaiw T[ -ca.?- ] 

5  onpavai, vac. ? TAnV Beo0 Se6wKOTO[¢ -ca.?- ] 
TH¢ peBodiac(*) Kad ‘Ytratiou Tob év[ -ca.?- ] 
éxTrépwar Kare Tv PaciAida Tv TOAEwv -ca.?- ] 


v 
[ -ca.?- ] tT ASavaoiog avy B(e@) oXOA(oTIKOC) UpéTep(oc) 
Apparatus 
41.3. or UTtepoxfi 
41.3. 1, OfijaAnwers 
41.4. |. KivnGévtac 
Ar.6. |. peBodeiag 
Notes 
PX 


Cf. D. Hagedorn, APF 59 (2013) 123-137, esp. 127-136. 


bmepoyn) (or bxepoxf}) was probably proceeded by 7 suetépa in the previous line, either in the 
nominative or dative case, see, e.g., CPR 14 9.10 (Arsinoite, July - Nov. 607); P.Miinch. 3.1 79.5 (prov. 
unknown, 6th/7th c.); P. Oxy. 63 4397.192, 219 (March 17, 545). The word refers to high-ranking 
persons, usually duces, cf. J. Gascou, [Review of] P.Mich. XIII, CdE 52 (1977) 360-368, at p. 363, and 
G. Agosti, AnalPap 10-11 (1998-1999) 7-15, at p. 13. Instead of iaArwere, Siarciere (“intervals”) 
might have been intended. We prefer the former, however, because it is more consistent with the 
prevailing theme of legal procedures (see intro above). 


: The layout of the edition: transcription, apparatus, and commentary. 


Editorial History; All History; (detailed) 

2016-03-10T10:16:41-05:00 [james.cowey]: Finalized - Ready. 
2016-03-10T09:57:36-05:00 [james.cowey]: Vote - Accept-Straight-to-Finalization - 
Fine 

2016-03-10T09:55:57-05:00 [james.cowey]: Submit - For renaming. 
2015-12-16T09:23:00-05:00 [simoeis]: Finalized - Ready 

2015-12-16T09:19:11-05:00 [simoeis]: Vote - Accept-Straight-to-Finalization - Fine 
2015-12-15T10:55:21-05:00 [simoeis]: Submit - listBibl has been removed 
2015-06-11T08:55:16.459-04:00 [james.cowey]: Finalized - Ready. This edition has 
undergone the peer review process having been vetted by two senior editors. Their 
suggestions and corrections have been taken into account. 
2015-06-11T08:55:16.451-04:00 [james.cowey]: Vote - Accept-Straight-to- 
Finalization - Fine. 

2015-06-11T08:46:09.179-04:00 [james.cowey]: Resubmitting an edition prepared by 
R. Ast and L. Berkes, because problems with finalization had arisen 
2015-05-28T08:30:40.46-04:00 [berkes.lajos]: We are submitting a full edition of 
this papyrus, including revised text, HGV data, commentary, and translation. We have 
taken into account suggestions made by the readers. 


4: The editorial history of DDbDP 2015 1. 
Perspectives and Challenges in Editing 
Documentary Papyri Online 


Fig. 


Fig. 


A Report on Born-Digital Editions through Papyri.info 


1Text editions at Papyri.info 


This contribution reports on the digital publication of Greek documentary 
papyri through the platform Papyri.info: 


Papyri.info has two primary components. The Papyrological Navigator (PN) 
supports searching, browsing, and aggregation of ancient papyrological 
documents and related materials; the Papyrological Editor (PE) enables multi- 
author, version controlled, peer reviewed scholarly curation of papyrological 
texts, translations, commentary, scholarly metadata, institutional catalog 
records, bibliography, and images.218 


The PE allows registered users to contribute content to the database. The 
submissions are then vetted by members of the editorial board. These 
contributions vary from correcting typos or entering translations to 
proposing new readings in already published texts. After this feature was 
introduced, the submission of corrections led quickly to the realization that 
in some cases extensive corrections may well justify reeditions of texts. 
This was a further impetus to experiment with accommodating born-digital 
editions of documentary papyri at Papyri.info.219 

Six papyri have been published so far in Papyri.info and some more 
will appear soon. The first edition dates back to 2013, when Nikos Litinas 
transcribed the back of P.Corn. 34 via Papyri.info. Subsequently and after 
much consideration, the editorial board of Papyri.info decided to present 
his readings not as corrections, but as a proper online edition.220 In 2014, 
the idea of exploring this issue in the framework of a seminar at the 
University of Heidelberg arose. In a class on digital papyrology offered in 
the summer semester 2014 by Rodney Ast, James M.S. Cowey, and myself 
several unpublished papyri were discussed and prepared for a digital 
edition. These were Gothenburg papyri that were only described in H. 
Frisk’s 1929 publication (P.Got.). The editions were peer-reviewed by at 
least two specialists in the field and appeared exclusively in Papyri.info. 
The three texts were fragmentary letters from the VI-VII c. AD.221 The 
publication of these documents was announced in the Bulletin of Online 
Emendations to Papyri 5.1 (January 7, 2016).222 More recently, two 
ostraka were republished in Papyri.info.223 The descriptum O.Did. 37 was 
emended by Héléne Cuvigny to an extent that justified a new edition, 
although no introduction or line-by-line commentary was added.224 


O.Berenike II 237 was republished with introduction and commentary by 
Roger Bagnall and Rodney Ast.225 

Work on these editions was a challenging task and not only for the 
usual scholarly reasons. Although using databases has very much become 
an indispensable tool for papyrologists, producing an edition which appears 
exclusively online but conforms to the well-established form of 
papyrological editions turned out to be quite difficult. In what follows, I 
will outline the main problems we faced during this process and present the 
preliminary solutions that we have been able to come up with. The form of 
online editions of documentary papyri is far from final: which conventions 
and solutions will be adopted depends very much on the reactions of the 
community. In this spirit, this contributions aims also at encouraging 
further discussion of online editions in the papyrological community. 


2An example: DDbDP 2015 1 


Let us have a look at one of the editions, DDbDP 2015 1226 (= P.Got. 54 
descr.). The layout of this edition is essentially the same as in the case of 
already published texts in DDbDP. The only major difference is the 
presence of an extensive introduction and commentary.227 The edition 
begins with a summary of the HGV, TM and APIS metadata.22s Next 
follows the introduction, the image and the translation in a convenient 
layout.229 The edition ends with the text and apparatus and notes to 
individual lines (see Figg. 1-2-3). 

This structure preserves the traditional, well-established format of 
papyrological editions, but offers some distinct practical advantages. 
Scholarly work — even in the humanities — has been heavily relying on 
digital resources over the past decades: research is basically unthinkable 
without the extensive use of databases, journals and books available online. 
This is especially true for papyrology, since the field offers a wide range of 
excellent and indispensable databases for its practitioners. Furthermore, 
engaging with the discipline often requires dealing with scattered 
information: where to find an image of the papyrus? Which textual 
corrections or interpretative suggestions have been made to the ancient 
texts one is working on? Finding this information is relatively easy in the 
very well organized field of Papyrology, but after finding the references, 
scholars often end up searching other databases for the referenced images, 
texts, articles, and books online.230 


The digital editions offered in the DDbDP clearly faciliate this process. 
The (downloadable and zoomable) image appears next to the papyrus, 
which renders checking the readings very easy.231 Metadata and links to the 
relevant entries of other important databases in the field can be found in the 
introduction, while an effort has been made to give direct links to all 
papyrological texts in the DDbDP and the referenced articles,232 when an 
online version is available — which is increasingly the case. The reader gets 
all the information, which he or she would otherwise collect from different 
sources, in one package. 


3Technical challenges 


I have tried to demonstrate the advantages of digital editions by using the 
example of DDbDP 2015 1. However, it has to be said that producing these 
editions was a difficult undertaking often hampered by technical 
challenges. Sometimes the display does not show the result one would 
expect even though the XML markup is correct. There were numerous 
problems in laying out the texts, which often required ad hoc solutions. An 
example are raised letters which are often used to designate second 
editions. However, these cannot be used in the commentary or the 
introduction: they have to be represented in other ways. A more serious 
problem is the case of PDF files: one of the most important features, which 
the scholarly community would expect, is that online editions can be 
converted into downloadable PDF files, i.e. print versions. This way digital 
editions could be accessible in a more traditional, tangible form. It seems, 
however, that this is technically a much more complex issue than one 
would expect. There is unfortunately no easy way at present to transform 
the digital editions made at DDbDP into PDF files. 

There are also limitations on encoding those features in Leident/XML 
that were already difficult to handle during the digitization of print 
publications. It is very difficult, for example, to reproduce the layout of the 
papyri at present. This is not that visible in the case of papyri published in 
the DDbDP so far, but there would be certain cases (e.g. accounts with 
specific layout) that would be very difficult to reproduce. It is also 
impossible at the moment to represent abbreviations in the apparatus, as in 
printed editions. The abbreviations of the address in DDbDP 2015 1 ( ] ft 
AOavdoiog obv O(E) oyoA(aotiKOs) DuETEP(os) ) would have been 
represented in the apparatus of a printed edition in the following manner: 


o®*vvoyx-ovpetep ~ pap. Whether one includes this in the apparatus is very 
much up to editorial traditions and preferences; one may argue, in fact, that 
the presence of a high quality image of the papyrus renders detailed 
approximations of abbreviations superfluous. However, a digital edition 
could offer more in this case: it would be certainly possible to link the 
abbreviations in the text directly to the image. 233 Even individual letters of 
the transcription could be matched with the image and thus online editions 
could become an excellent tool for self-study.234 

These technical issues have imposed some limitations on the editions 
published in the DDbDP, but there is no doubt that all these problems could 
be solved. If such digital editions are be accepted by papyrologists, it will 
be only a question of time and money to create an editorial platform at 
Papyri.info that can serve all the needs of traditional editions. The question 
is rather where the priorities of papyrologists lie, or in other words: is this 
worth the effort in a field with such limited resources? Another question 
may be asked at this point as well: should we expect online editions to 
conform to the norms of traditional printed editions or should we accept 
them as a slightly different form of publication? 


4Referencing digital editions 


One of the most difficult problems we faced after creating the editions was 
their naming and referencing. There are two major issues here: 1. How 
should these editions be integrated in the standard reference system of 
papyrology, Checklist of Editions of Greek, Latin, Demotic, and Coptic 
Papyri, Ostraca, and Tablets (hereafter Checklist)?235 2. Since these texts 
are part of Papyri.info, their text, introduction and commentary can be 
emended and updated by users and these improvements may change the 
‘original’ edition. This would imply that these editions have no stable form, 
but are on some level fluid.236 This ‘fluidity’ obviously creates difficulties 
in referencing: how can this problem be dealt with? 

Both issues have created significant discussion among editors of the 
DDbDP and the solutions are preliminary. In what follows, I will outline 
the main lines of thinking behind the solutions which have been 
implemented so far. Since editors themselves disagree on some of these 
questions, it has been decided to wait for the reactions of the papyrological 
community and continue developing the form of the editions based on its 
reactions. 


The papyri which have been published in the DDbDP so far were all 
described before and had a corresponding reference in the Checklist. This 
made things easier in a way, since we could have opted for slightly 
modifying the already existing references of these papyri, but at the same 
time complicated issues even more, since we did not want to introduce 
anomalies in the Checklist. Furthermore, we also had to consider a 
reference system for papyri that did not have a Chechlist-identifier yet, so 
that it could be used for further publications. 

At first sight, these publications represent the same case as papyri 
published in journals. They could have been referenced in a similar fashion 
and then included in the Sammelbuch griechischer Urkunden aus Agypten, 
as usual. However, an important argument came up quickly: why should we 
double the effort, if the texts printed in the Sammelbuch are reentered into 
the DDbDP again? This is especially true considering that it is inevitable in 
the long run that the Sammelbuch will be published digitally (only?). It was 
also important to indicate the date of publications, since this way these 
editions become comparable to journal publications. Taking all these 
considerations into account it was decided to use the identifier DDbDP 
year number, e.g. DDbDP 2015 1: this identifier represents a collection of 
digital-only publications.237 The numbering follows the sequence in which 
the submitted publications were included in the DDbDP. This creates a 
clear and transparent system that enables straightforward referencing of 
editions born at Papyri.info. These editions are regularly announced 
through the Bulletin of Online Emendations to Papyri (BOEP)238 in order 
to make their existence known to the papyrological community. 

The other issue is what I have called the ‘fluidity’ of these texts. 
Papyri.info enables editing the text, introduction, and commentary — 
essentially any part of these digital editions. This leads to obvious problems 
in referencing them. For instance, let us say someone refers in an article 
published in 2018 to the introduction of DDbDP 2015 1. However, in 2019 
a user of the DDbDP proposes a change to the introduction of DDbDP 
2015 1 that affects exactly the part of the text which was referred to in 
2018. His or her suggestion is reviewed and accepted by the editorial board 
of the DDbDP and subsequently replaces the text referred to in the 2018 
article. If someone checks the 2018 article in 2020 and wants to look up the 
reference, he or she will not find it. The information that the introduction 
has changed would be available in the editorial history of DDbDP 2015 1, 
but this is not an obvious or user-friendly place to look for this information. 

This is a problem at the present, even if it is only a theoretical one, 
since no changes have been made to these editions so far. The editorial 


board of the DDbDP has debated how to deal with such scenarios, but there 
are basically two solutions. The more traditional way is to track the 
different versions of these editions. This means that in theory something 
like DDbDP 2015 1 (2), 1 (3) etc. would come into existence each time the 
edition would change. In an ideal world the user would be able to switch 
between these versions with the differences being highlighted. However, 
this is impossible to do presently. 

Another way of thinking would be to accept these digital editions as a 
new form of scholarly publication that is not as stable as the traditional 
ones. This would imply that scholars would need to get used to the idea that 
the texts they find online can change anytime and that they need to check 
them each time before quoting them and always refer to them with a date of 
access. This approach has certainly some appeal, as it creates a faster, more 
direct way to do scholarly work, but there are also caveat-s. This method 
may lead to chaotic references and a certain lack of transparency. However, 
we also have to accept that at some level this is already becoming the 
reality of scholarly work. Discussing drafts publicly on an online platform 
(e.g. at https://www.academia.edu) has become increasingly common even 
in the humanities (this practice is much more widespread in the sciences). 
We all refer in papyrology to unpublished documents, drafts, personal 
communications of colleagues: in the past, even communications “per 
litteram” were normally cited and accepted as scholarly references. These 
are in a way also messy references, since they cannot be easily checked or 
verified, especially since some of this information never becomes publicly 
available. If we want to fully exploit the possibilities of digital publications, 
we may need to accept this ‘fluidity’. 

Finally, I would like to mention a further minor problem in referencing 
the text: there is no traditional page numbering. In my view, however, this 
is not a real problem. First of all it is very common in Papyrology to refer 
only to introductions or commentaries of editions without mentioning the 
page number. But what is more important: references can be very quickly 
found in an online environment through the search function of the browser. 
Even though some references to online editions may seem vague at first, it 
is very easy to deal with them practically. 


5Perspectives of digital editions 


I have tried to demonstrate the main advantages and the accompanying 


problems of digital editions at Papyri.info. As I have emphasized: it is still 
an open question whether this format will be accepted and developed. If it 
is accepted, it would certainly represent a new, more fluid form of textual 
editions in our field. If we try to look at the bigger picture, this model 
offers some distinct advantages beside the practical ones I have mentioned 
so far. Publication through this platform is open-access, peer-reviewed, and 
fast. Once an edition has been properly vetted, it can be published without 
further delay. This platform may also offer the possibility to quickly 
describe or publish smaller fragments that could swiftly enter DDbDP this 
way and thus become searchable. This of course should not imply that 
Papyri.info limits itself to the publication of small pieces that would 
otherwise be ignored. 

There are also some problems with this model. One issue is the 
appreciation of online, open-access publications in our field and the 
humanities more generally. Even though most scholars agree that such 
publications are desirable and represent the future, they are still not really 
valued. If someone were to decide to publish a papyrus in a well- 
established journal or at DDbDP, it is pretty clear that the former possibility 
is much better for one’s CV, even if submissions to the DDbDP go through 
a peer-review process. An additional problem in this respect is that while in 
an article one can bundle several papyri together, this is not possible at 
present at Papyri.info.239 Only time can tell how fast the attitude towards 
digital publications will change in our field. However, it is pretty likely that 
this will happen, as has been the case in the sciences for quite some time. 

Another problem is the issue of limited resources in our field. 
Papyri.info is indispensable at the moment, but is struggling to keep up 
with new publications, BL entries and other corrections, etc.240 So the 
question is: what are our priorities? The focus of the editorial board of 
Papyri.info has been very pragmatic: making as many new texts as possible 
available. This has led to certain compromises. For instance, alternative 
readings are often not entered during the entry stage and many (BL and 
other) corrections are still missing. We believe that it is better to have more 
material online (even with some infelicities) than to stick to a more limited, 
but also more flawless corpus. 

Focusing more on publishing papyri on Papyri.info would certainly 
also require an effort from the papyrological community: scholars would 
need to vet the incoming submissions and be open to treating editions at 
Papyri.info the same way as they would treat printed publications. The 
online publication does not have to stop with documentary texts; the 
existence of the Digital Corpus of Literary Papyri (DCLP) at http:// 


www.litpap.info could also open the door to editing literary papyri using 
essentially the same platform.241 Languages do not have to be a limit either: 
at present it is possible to publish Latin, Greek, and Coptic papyri at 
Papyri.info and the language of publication is not restricted to English.242 
The potential is certainly there, but priorities need to be set. 

I believe that the only way to find out whether Papyri.info could work 
as a platform for editing papyri is to give it more practice. We need to see 
whether this kind of edition and reference system works for Papyrology or 
not. It may turn out at the end that some (inevitable) chaos in referencing 
these texts is outweighed by the advantages that this kind of direct 
scholarly work provides; on the other hand, it is also possible that 
papyrologists decide to stick with more traditional forms of publication or 
prefer other digital options. However, to assess this we would need more 
online publications. At moment, anybody can submit a papyrus for 
publication at Papyri.info. It is hoped that this article will encourage 
scholars to explore the possibilities of editing papyri on this platform or at 
least to open a dialogue about its advantages and disadvantages. 
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Massimo Magnani 


The Other Side of the River 


Digital Editions of Ancient Greek Texts Involving Papyrus Witnesses 


1Introduction 


More than forty years ago, on the footsteps of Dom Froger (1968), WEST 
1973, 70-2 asked himself for which editorial task the use of the computer 
could have helped. After having discarded the automatic collation,243 West 
imagined that a computer, provided with the transcriptions of the 
manuscripts “purged of coincidental errors”, could have drawn up “a 
clumsy and unselective critical apparatus”. Then, if contaminations have 
been out of question, this computer could have worked out “an ‘unoriented’ 
stemma” by comparing the variants, but it could not have been able to 
choose the correct orientation of the stemma, an operation possible only 
“by evaluating the quality of the variants”.244 Thereafter, it has been 
assumed that the computer might be useful even for an_ heavily 
contaminated paradosis: the late Bryan Hainsworth, presenting in short the 
manuscript tradition of the Odyssey, imagined that a computer could 
establish the degree of contamination of every single family of the 
Odyssey’s paradosis, but in his opinion the result would not be 
commensurate with the amount of work.245 After years, what is the 
situation, with reference to the ancient Greek literature and texts? If we 
apply to the term ‘edition’ the usual scholarly meaning, i.e. ‘edition of a 
text based on the method(s) of textual criticism’, and we expect that the 
expression ‘digital edition’ can refer, among the variety of the editorial 
products, not only to the more or less refined digitization of old or new 
‘traditional’ editions of ancient Greek texts,246 but also to the ‘edition of a 
text based on a new digital method of textual criticism’, we are destined to 
disappointment, at least for the time being, and not surprisingly. 


2‘Digital editions’ of classical texts: an overview 


The Catalogue of Digital Editions, the most complete “attempt to survey 
and identify best practice in the field of digital scholarly editing”,247 
gathers 256 ‘digital editions’, among which 12 edition of ancient Greek 
texts (30.10.2017).248 In fact, none of them is a new edition based on 
textual criticism applied to a multi-manuscript tradition: 11 editions are 
catalogued as ‘digital scholarly editions’;249 six of them are editions of a 
text transmitted by unique witness (4 are editions of epigraphic 
collections,250 two involve the text transmitted either by a single2s1 or by a 
peculiar and very valuable ancient manuscript252), two are the digitization 
of the standard, ‘traditional’ editions of the Old and New Testament in 
Greek language (ID 163 = LXX Septuaginta — http://septuaginta.net;73 = 
Digital Nestle-Aland — http://nestlealand.uni-muenster.de), one is the 
edition that gathers the electronic editions of the Gospel according to St. 
John in Greek, Latin, Syriac and Coptic UD 58 = http:// 
www.iohannes.com). Finally, I will not include the Digital Athenaeus 
among the digital scholarly editions, since it is the digitization of Kaibel’s 
text (with tools).253 The same situation has been recently underlined by 
Ermanno Malaspina with reference not only to the ancient Greek domain 
but also to the philological studies in general.254 An exception, once 
finalized, should be his project of digital edition of the Ciceronian Lucullus 
(in collaboration with MeDiHum, Turin University, e IAS, Durham 
University): a ‘Lachmannian’ critical edition of a classical text transmitted 
by 74 mss. arranged in a closed recension, a sort of ‘crash test’ of the DH 
resources, and most of all of the TEI encoding.255 The final result will be a 
web page with critical text and apparatus, with the possibility to open 
windows with the text of individual manuscripts or editions in 
correspondence with their variant readings of ecdotic significance. To mark 
a difference from the mere digitization of a ‘traditional’ critical edition, the 
apparatus of this Zucullus online aims to be much richer in data without 
yielding to the principles of genetic philology, but offering all that is 
relevant for the tradition of the text. In 2015 Malaspina performed a 
transcript from Word to XML through Oxygen of the readings of the 74 
manuscripts and of some printed editions limited to 4 text-paragraphs. By 
following the TEI guidelines (ch. 12), each variant reading was tagged 
(<rdg>) referring through the @wit attribute to the list of the mss., and 
through one of the four @type attributes to its ecdotic classification (in 
order of increasing relevance: orthographical variants, polygenetic variants, 
variants significant for the constitution of the text, variants significant for 
establishing the 

stemma codicum). The JavaScript prototype, performed by Peter Heslin 


(Diogenes’ inventor), allows for the data display and the control of the 
tagging errors; the last stage will be the synoptic display of the text of each 
witness. 

The lack of “‘comprehensively digital’ scholarly edition of a ‘classical’ 
text with a manuscript-based multi-testimonial tradition” was noticed by 
MONELLA 2012.256 According to the responses given to his cognitive 
survey on Academia.edu and Digital Classicist, the main reason should be 
“time and money”, but Monella believes that the real reason is the lack of 
need. Digital scholarly editions (DSE) are favourite scholarly products of 
codicologists, epigraphists, papyrologists, editors of documentary 
manuscripts and palaeographers, says Monella, because they are focused on 
documents, and by 


‘genetic’ editors of modern and contemporary texts [...]257 and historical 
linguists, who may study the evolution of language and orthography through 
‘errors’ in inscriptions, in manuscripts and in modern print materials throughout 
the centuries, 


because their concern is the textual variance. Classical editors who are 
dealing with texts transmitted by a “manuscript-based multi-testimonial 
tradition” are dealing with ‘canonized’ ancient texts, where textual variance 
is due — or credited to be due — for the most part to the erroneous medieval 
paradosis and not to the author. ‘Errors’ are identified and collected only 
for establishing the stemma codicum and the text itself. Finally, continues 
Monella, for classical editors the manuscript is important especially as 
‘textual vehicle’ and does not bear a particular relevance in itself. 
Therefore, e.g., why should the Aeneid’s editor digitize even a limited part 
of this manuscript tradition? The purpose should not be the expansion of 
the critical apparatuses with the inclusion of more data — usually, this 
purpose is disregarded by the ‘traditional’ editor because of their editorial 
irrelevance, but “a plural, fluid concept of text, a concept implying that 
each document’s text is worth studying as a historically determined cultural 
object”, and the increasing interest in ““post-classical’ Latin and Greek — 
thus joining forces with historical linguists and romance philologists”. In 
my opinion, it is not completely true that the ‘traditional’ classical editors 
and philologists are only interested in manuscripts as witnesses of the text: 
on the contrary, we see an ever-growing attention to the ancient and 
medieval manuscripts as essential witnesses of the cultural reception of the 
related texts. It is acceptable that the digital scholarly editions can provide 
the best framework, in order to manage and display the rich data complex 
derived by a multi-manuscript paradosis in its entirety (see also below). 


Certainly, this approach can be of great cultural importance and can 
stimulate the discipline to reconsider its goals and methodology;25s 
moreover, the IT solutions devised in response to the problems associated 
with digital editions constitute an advancement and it will contribute to 
create new skills and new professional figures, e.g. the ‘computational 
scholars’, that is “philologists skilled in both classical philology and 
computer science”’.259 


3Case study 1: Mastronarde’s Scholia Euripidea 


Before trying a provisional conclusion, I would like to review with some 
detail two digital scholarly editions. Among the aforementioned editions, 
the only one that is really a new edition, even though based on the 
‘traditional’ method(s) of textual criticism, are Mastronarde’s Scholia 
Euripidea (supra, n. 7), aiming to supersede the standard work of 
SCHWARTZ 1887—91.260 As Mastronarde writes, “the goals of this project 
are quite traditional in a philological sense, but also experimental and 
forward-looking in terms of format”. This view is very instructive. On the 
one hand, Mastronarde did not use the computer resources, in order to 
review the collations made by the previous editors,261 to “clarify the extent, 
nature, and possible stemmatic relationships of the scholia in some of the 
so-called recentiores”, or to put a better order to the scholiastic corpora of 
the Palaeologan era (Maximus Planudes, Manuel Moschopoulos, Thomas 
Magister, Demetrius Triclinius). On the other hand, his choice for an open- 
access digital format responds to specified intellectual and educational 
needs, apart other very sensible economic, professional and scientific 
reasons: “a digital format is variable [...], updatable, [...], allows for 
sharing of interim stages of the work, [...] is expandable, [...] searchable in 
a way that a printed volume is not”. The project is so far limited to the first 
500 lines of Euripides’ Orestes and to 20 “witnesses of various kinds” of it; 
the play was selected by Mastronarde first because, 


as a triad play, it provides the maximum degree of variety and complexity in the 
annotation tradition and therefore forces one to confront most or all of the issues 
that may arise as the work proceeds”, 


then for the availability of images. The problems of conversion of a 
‘traditional’ critical edition to XML/TEI format have been so relevant, that 
only Orestes 1-25 and 401-25 have been published, with the Triclinian 


metrical scholia to the parodos and the prefatory material.262 

Mastronarde created four levels based on the TEI division-type 
element:263 the div1 element, the largest one, includes all the material 
related to the correspondent play, including one or two div2 elements, 
containing the introductory texts and the scholia. div3 always has three 
required attributes and occasionally an optional fourth one; the first two 
give a complete and expandable classification of the scholia (“@type = vet, 
rec, mosch, thom, tri, plan, vetMosch, vetThom, vetMoschThom, 
recMosch, recThom, recMoschThom, moschThom; @Wsubtype = exeg, 
gloss, paraphr, gram, artGloss [‘‘a gloss that consists only of the article 
agreeing with the glossed word” ], etaGloss [“an eta placed over a Doric 
alpha in a lyric passage to indicate the normal form”]”). The third one 
required attribute is the @xml-id of the play. The div4 elements are the 
“children of each scholion div3”, the only one of these being mandatory is 
the one including the text of a single scholion with its lemma and its 
witness list (@type of ‘schText’). One of the main problems has been the 
impossibility “to key an apparatus item to a line number’, problem that can 
not be easily overcome, considering that “anchoring each apparatus item to 
a single word or phrase is possible, but the markup would be far too time- 
consuming and in my opinion out of proportion to any possible gain”. After 
the text of the scholion, “a required seg with @type of ‘witnesses’ contains 
the sigla of the manuscripts that contain the scholion”, then follow seven 
(or less) other kinds of div4: engTrans (only for a choice of scholia), 
lemmaPosNote (“details about variations in the lemma, the presence of 
reference symbols linking the scholion to the text, and the position of the 
note”), appCrit and appCrit2 (the second critical apparatus, “for 
orthographica and other minor matters”), and commentSim (commentary 
and similia). Interesting is the creation of two tags systematically 
identifying two types of information, that in the ‘traditional’ editions are 
rare and scattered among the mss. readings: collNotes (“collation notes, 
record of difficulties in reading images, of divergences from previous 
reports, and reminders to check the original or a better image at some 
future date”) and keywords, in fact a reminder “for additional description 
of the content in aid of searching or indexing at a later point”. Both of them 
are not publicly displayed, but reserved to the author and collaborators for 
future work. Another interesting piece of information, again not always 
systematically recorded in the traditional critical apparatuses, is the 
location of the scholia (above the line, marginal, or intermarginal), the 
variation of their sequence, and the indication of the point where a scholion 
begins and the other ends. Mastronarde’s choice is due to the difficulty of 


using superscripts in XML, therefore reserved to indicate different hands or 
“different versions of the same note at different locations in the same 
witness”. That Mastronarde’s edition is traditional and digital only for 
having chosen this format of displaying the textual contents is also clear 
from the treatment of the metrical scholia, limited to Triclinius’ scholia on 
the parodos of Orestes. As Mastronarde’s underlines, by assigning a 
different tagging to the metrical scholia, XML allows to display the 
metrical scheme and the text of Triclinius’ mss. side by side with the 
scholion, while DE FAVERI 2002 had to publish them separately, at the back 
of the book. 

Leaving aside, of course, the differences due to the progress of the 
studies on the manuscript tradition, a limited comparision, restricted to the 
incipit of the play (and to the vetera set of scholia), between Mastronarde’s 
edition (“full view” mode of display) and the standard edition of Schwartz 
is instructive:264 the digital edition shows five scholia vetera to Or. 1 (28 
overall), all of which transmitted only by V (= A in Schwartz),265 while 
Schwartz prints only three of them, with the first and the third ‘tagged’ 
with a peculiar [dia]critical sign, a crux indicating not editorial desperation 
but the recent origins of those scholia26s (all of them are defined as vetera 
exegetica in Mastronarde’s edition). The text of the three scholia in 
common does not differ; Mastronarde’s apparatus is richer, is preceded by 
the English translation and has additional information (variations in the 
lemma, the presence of reference symbols, orthographical matters, and a 
brief commentary). The convenience of the digital format depends on the 
aforementioned statement (variability, upgradability, accessibility of 
interim stages of the work, expandability, searchability), but this edition did 
not benefit of digital tools neither for the mss. collation nor for the 
arrangement of the ms. witnesses in an open or closed stemma designed for 
every single scholiastic corpus.267 Its apparatus has been created by 
Mastronarde and is not the result of the information recovery based on the 
computer assisted collation of digital transcriptions. 


4Case study 2: the Homer Multitext Project 


Returning to the initial question, Harris’ warning26s can be partially shared, 
but to my opinion the problem is not so much that the ordinary reader has 
no desire to loose him-/herself in the maze of variants that every 
philological operation inevitably generates, but that the concern about 


XML/TEI and its editorial application risks to make us lose sight of the 
final objective of every philological operation: that of establishing a text. 
On the other hand, I completely agree with him that if the application of the 
IT to the philology, precisely because it allows multiple choices, is 
transformed into the abdication of choice, this application is improper, 
because the job of the philologist is to choose. In a sense, the most 
ambitious project of digital scholarly edition is the one that has chosen to 
completely embrace that risk, not simply in order to avoid the choice but 
denying its methodological correctness. With the words of its editors, C. 
Dué and M. Ebbott, 


the Homer Multitext Project seeks to present the textual transmission of the 
Iliad and Odyssey in a historical framework. Such a framework is needed to 
account for the full reality of a complex medium of oral performance that 
underwent many changes over a long period of time. These changes, as 
reflected in the many texts of Homer, need to be understood in their many 
different historical contexts. The Homer Multitext provides ways to view these 
contexts both synchronically and diachronically.269 


Therefore, the Homer Multitext Project offers “free access to a library of 
texts and images, a machine-interface to that library and its indices, and 
tools to allow readers to discover and engage with the Homeric tradition”. 
Editors (and co-editors: D. Frame, L. Muellner, G. Nagy) reject the 
traditional approach for //iad and Odyssey and the possibility of a critical 
edition of the poems establishing “an original text as it supposedly existed 
at the time and place of its origin’”’.270 Persuaded by the idea of a Homeric 
tradition always evolving from the pre-classical age to the Byzantine era, 
the Homer Multitext editors believe that the textual variants even of the 
medieval paradosis are the sign of the variation inherent to the system of 
oral poetry and that even when the poems were finally written down, they 
continued to be performed orally.271 The consequence is that //iad and 
Odyssey have never been fixed texts. Therefore, the Homer Multitext 
Project is gradually editing the Homeric witnesses (papyri, medieval 
manuscripts, ancient quotations), considering each manuscript and each 
textual variation as the valuable testimony of the oral tradition. We are not 
told if all the manuscripts will be digitized: six medieval Iliadic codices of 
well known relevance have been completely or partially digitized, not 
always successfully,272 and only one papyrus.273 That is, a work in 
progress. The scientific foundation of it was placed by two ‘traditional’ 
publications: “a multitext edition with essays and commentary” of //. X and 
an overview of the Ptolemaic papyri as witnessing the multitextuality.274 


This is not the place where to discuss in detail the position of Dué, Ebbott 
and Gregory Nagy on the Homeric tradition; nevertheless, this position is 
the essential reason for their peculiar digital edition and therefore deserves 
a short presentation (and some considerations). Bird’s very compact 
monograph aims to be a general introduction to textual criticism applied 
first to classical and biblical texts (pp. 1-26), then to Homer in general (pp. 
27-60), finally, to the Ptolemaic papyri of Homer (pp. 61—100), interpreted 
as the evidence not of the “eccentricity”’275 but of the everlasting 
“multitextuality” of the ancient Homeric tradition. Bird defines as 
“authentic” and “original” all 

the variant readings that seem Homeric both by nature and by ancestry, but 
this judgment will not automatically define the other variants as “spurious” 
or “unoriginal”: in others words, any variant reading that is not an obvious 
copyist's intervention must be considered authentic in principle. 
Synthetically, when two variants have internal and external factors on their 
side that prove or suggest authenticity, the Homer editor must refuse the 
traditional way of thinking of the philologist that would lead to make a 
choice between the two: “there is no need to choose one reading and reject 
the other”.276 This way of operating is programmatically opposed to the 
Lachmannian critical setting, also from the point of view of data 
presentation (i.e. a traditional critical edition that provides a main text at 
the top of the page, accompanied by a presentation of variant readings in 
the critical apparatus at the end of the page). For this reason, Bird proposed 
a new layout presenting the variant readings at the same time without 
resorting to the critical apparatus. The purpose is to print multiple versions 
of the text in parallel, creating a “multitext edition of Homer, one that 
would be expected not only to report variant readings but also to relate 
them as possible to different periods of history”.277 The weakness of this 
position is, in my opinion, the unproven equivalence of the aedic phase 
with the rhapsodic and then the Ptolemaic stage of the Homeric tradition — 
apart from a not entirely convincing and forced analogy between the 
Homeric tradition and that of the New Testament. Not all the variant 
readings have a certain tradition going back to the classical age as the 
‘Zenodotean’ (and Aeschylean) doita in //. I 5.278 Ptolemaic papyri seem 
rather to hand down rhapsodic variants,279 such as those transmitted from 
the medieval text of the Homeric Hymns. An informative example of the 
erroneous Bird’s approach is his discussion2s0 concerning the “minor 
textual variants” attested for /7. VI 287-8 in P.Sorb. inv. 2302 (TM 61240 = 
M-P3 786.1 = LDAB 2380 = Allen-Sutton-West p480a): in his opinion, 
adodiccay Katé and OdAapoy KateBycato, rightly defined at first as “clear 


examples of spelling reflecting pronunciation’,281 “convey the memory of a 
live performance, with all its speed and dramatic intensity”; this spelling, 
continues Bird, “presumably would not happen if the lines were being 
dictated slowly and carefully”. To affirm this, the phenomenon should be 
limited to the Homeric papyri, but it is notoriously widespread in texts of 
various nature, literary as well as documentary. A similar misinterpretation 
concerns the spelling t6p pa for tov pa in J/. XVII 578, attested in P.Sorb. 
inv. 2303 (TM 61117 = M-P3 948.2 = LDAB 2255 = Allen-Sutton-West 
p501c).282 

Multitextuality has been also applied by DUE — EBBOTT 2010 to the 
tenth book of the //iad:283 the two scholars, in order to avoid presenting a 
critical text “that obscures the multiformity of the oral tradition”, choose to 
print four witnesses that represent the state of the text in different historical 
periods (a papyrus of the II century BC, one of the III AD, one of the VI 
AD, and the Venetus A). The ‘critical’ text is followed by a commentary, 
which, although not inclusive of all the information of a critical apparatus, 
in the author’s intentions makes it possible to better explore the differences 
between the witnesses as it is more focused on the chosen texts. In fact, this 
edition is the paper version of the digital project, and, as far as the paper 
edition is concerned, there are many reasons for perplexity: the main reason 
is that, although the authors wish ideally to reproduce any text that presents 
significant variants, they are forced, by necessity of space and time, to 
make a choice regarding the number of witnesses to be presented, and the 
difficulty increases due to the deliberate omission of the critical apparatus. 
The specimina printed in the edition are certainly analysed in detail in the 
commentary and are cited together with other relevant witnesses, but, as 
admitted by the editors themselves, without reaching the level of 
completeness and systematicity that is instead of a critical apparatus. 
Moreover, such an edition is certainly no longer easily consultable with 
respect to a ‘traditional’ edition, above all because of the inconvenience of 
having to resort each time to the comment, separate from the text. The 
other disadvantages are: no translation is available, if not a few hints in the 
commentary (not even for the papyrus texts), a limited number of 
witnesses, no supplements for the papyrus’ lacunae. Therefore, for this 
work the definition of ‘critical edition’ is not appropriate, given that the 
Dué and Ebbott reject the traditional operation, as above anticipated (“we 
want to avoid presenting a critical text that obscures the multiformity of the 
oral tradition”). They believe that it is not the apparatus but the 
commentary that makes the edition ‘critical’, “but critical in a different way 
from what is usually indicated by the term”.284 As regards their definition 


of ‘textual criticism’, they consider that it is authentically exercised by not 
judging which text is right or wrong, but rather “to criticize what these 
texts contain in terms of the textual tradition and the oral tradition that 
preceded it”; in other words, one can only distinguish what is a trivial 
scribal error from a genuine oral variation.285 This unusual way of 
conceiving the critical edition, however, is the basis of an editorial product 
that, in addition to being extremely uncomfortable in the consultation, in 
my opinion fails precisely in its primary purpose, which is to allow 
‘independent use’ by the reader of these versions. The editors justify their 
unconventional editorial choice — which in fact is a ‘non-editorial’ choice, 
as they suspend any judgment on the value of the variant readings — 
claiming to join “to the goals of the Homer Multitext” Project, that is the 
realization of a digital scholarly edition of Homer. 

The editors did not hesitate to describe the project as a revolution. The 
Homer Multitext Project stems from the belief that it cannot in any way 
apply the traditional editing system to works such as the /liad and the 
Odyssey, because they originated from a long oral tradition without the aid 
of writing: 


in such a tradition in which the composition is occurring in the course of 
performance, there is no one “author of the original composition” to try to 
recover, for there is not only one composition, but also no other author. 


It is the oral nature of the so-called Homeric poems that renders traditional 
editorial methods ineffective: the textual criticism that must be applied to 
them should not move in the direction of a ‘paradigmatic’ (or ‘canonic’) 
critical text, but should maintain “the integrity of each witness to allow for 
continual and dynamic comparison”.286 According to the editors’ view, 
another traditional concept that must be abandoned is also that of “variant 
(reading)”, because it implies a judgment on the quality of the variants 
presented by the manuscripts: we should therefore adopt the more neutral 
term “multiform (reading)”. The logical consequence of this statement is 
that, if each variant reading is potentially ‘authentic’, the traditional critical 
apparatus no longer has any reason to exist: an approach of this kind 
“deliberately puts some central concepts and issues of conventional textual 
scholarship in crisis. The basic text, the text, the textual apparatus, and the 
variant”.287 Dué and Ebbott state with very explicit and categorical words: 
“the digital Multitext must be fundamentally different from these print 
editions in conception, structure and interface”.28s One point that is very 
relevant in the Project is the need to present the digitized text of all the 
witnesses but without the critical apparatus. Despite the conspicuous 


attention attracted by the Homer Multitext Project, the impression is that, 
beneath the statements of principle, there is currently very little concrete to 
compare with. In other words, there is a contrast between the limpidity and 
the firm certainty with which the Project guidelines have been presented 
and the indeterminacy with which practical questions are dealt with: e.g., 
how the texts of the manuscripts will be presented in the absence of a text- 
reference base? Which tool will be used to replace the traditional critical 
apparatus? In the section dedicated to the description of the project on the 
website of the Homer Multitext Project it still remains said that “unlike 
printed editions [...] the Homer Multitext offers the tools for reconstructing 
a variety of times and places”, without specifying what tools are involved 
and how they work. Moreover, the absence of precise guidelines on the 
structure of the Project had been admitted also by Dué and Ebbott,289 who 
however had set aside the issue, considering it a detail (“but no matter what 
the details end up being, we have committed to three foundational 
principles: collaboration, open access, and interoperability’”’).290 It must 
have been this vagueness in structural terms, together with a certain 
insistence on the importance of leaving the final choice to the reader, to 
have aroused the lucid judgment of M.L. West: “Nagy seems to think that 
an editor should simply marshal the evidence in a non-committal way”. 
While Nagy seems to assume that the Homeric editor’s most important task 
is to suspend his own judgment to prevent it from undermining his reader’s, 
West defends the right to actively exercise his own, concluding that they 
evidently have “very different concepts of the editor’s role’”.291 In Dué’s 
and Ebbott’s edition, the text of the 10th book of the Jliad becomes 
therefore four different Iliadic texts, three ancient and partial,292 the fourth, 
medieval and complete (Venetus A), each witness of the Homeric perennial 
multitextuality. Despite the absolute peculiarity of the Homeric tradition, a 
methodology of this type leads not only to the renunciation of the critical 
text, as said, but also to the renunciation of a critical approach tout court: it 
is revealing the treatment of the variant (or better, multiform) readings, 
each of which seems a priori to testify to an aedic (re)composition-in- 
performance, even though the editors tend to overlook their possible, often 
probable un-aedic origin (rhapsodic variants, glosses, conjectures).293 


5Conclusions 


As in many other scientific fields, the implementation of IT systems can 


lead to a methodological renewal only after a dissemination of their 
scientific use and only after having produced results superior to those 
obtained by past philologists with the traditional methods. This scenario 
could be achieved through better quantitative and qualitative information 
processing, especially with reference to the collation of the manuscripts and 
to the stemmatic contamination. The availability of IT tools will 
increasingly allow to broaden and better evaluate the data of the manuscript 
tradition and could avoid an excessive limitation of the witnesses only for 
the need to reduce their number. The greater ‘capacity’ of the computer 
systems certainly allows a simpler and more efficient collection of the 
variant readings, sometimes useless for the constitution of the text but often 
valuable to study the textual tradition in a wider cultural sense. Regardless 
of the case of the Homer Multitext Project, there is however the sensation 
of an overestimation of those variant readings not really significant for the 
constitution of the text, overestimation which is equally pernicious 
compared to their scarce eight-twentieth-century consideration. Sometimes, 
this ‘hypervalutation’ seems to be promoted by some digital editors after 
having observed the limited or null ecdotic progress of the scholarly work. 
There are undeniable advantages in the digital scholarly editions, that have 
been clearly illustrated, for example, by Mastronarde (see above), but it is 
also true, and obvious, that the ecdotic improvement derived from a more 
careful study of the paradosis is not necessarily produced by computer 
tools.294 There will certainly be a moment when the critical editor will be 
supported or even replaced by the AI, but the task of choosing the best 
possible text in a methodologically correct manner cannot but remain the 
essential purpose of the philological activity for most of the manuscript 
traditions handed down by a plurality of manuscripts. 
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2: Standard layer of the sentence [1] in Arethusa. 


<sentence document_t tps:77sematta.hum. helsinki. fi/edt id= Span="" subdoc=""> 
<word form="SU" head="" id="L" Lemma="" postag="" relation="" /> 
<word form="munaic"” head="@" id="2" lemma="nolew" postag="v2sfia---" relation="PRED" /> 
<word form="," head="4" id="3" Lemma="puncl” postag="u-------- “ relation="AuxX" /> 
<word form="&ekpe” head="2" id="4" Lenma="beAp~oc” postag="n-s---mv-" relation="ExD" /> 
<word form="," head="4" id="5" Lerma="puncl" postag="u-------- " relation="AuxX" /> 
<word form="2aSU" head="" id="6" Lenma="" postag="" relation="" /> 
<word form="SU" head="" id="7" Lenma="" postag="" relation="" /> 
<word form="" head="9" id="8" Lemma="6" postag="1-s---fn-" reLation="ATR" /> 
<word form="moprje" head="" id="9" Lemna="nopeia” postag="n-s---fn-" relation="" /> 
<word form="Tjj" head="11" id="10" Lemma="6" postag="1-s---fd-" relation="ATR" /> 
<word form="vuxti" head="" id="11" Lemma="VOE" postag="n-s---fd-" relation="" /> 
<word form="tTadtT)" head="11" id="12" Lemma="obt0¢" postag="p-s---fd-" relation="ATR" /> 
<word form="néywac” head="2" id="13" Lemma="néynw"” postag="Vv-sapamn-" relatton="ADV" /> 
<word form="pot" head="13" id="14" Lemmas"éyw" postag="pls---md-" relation="0BJ" /> 
<word form="tpia" head="16" id="15" Lemma="tpia" postag="n-p---na-" relation="ATR” /> 
<word form="Cedyn” head="13" id="16" Lemma="Cebyoc" postag="n-p---na-" relation="0BJ" /> 
<word form="pTwv" head="16" id="17" Lemmas"&pto¢"” postag="n-p---mg-" relation="ATR" /> 
<word form="éni" head="13" id="18" Lemma="Enei" postag="c-- " relation="AuxC” /> 
<word form="o0K" head="20" id="19" Lemma="00" postag="d-- “ relation="AuxZ" /> 
<word forms"éxo" head="22" id="20" Lemmas"éyw" postag="vispia---" relation="ADV_CO" /> 
<word form="Gptoug" head="20" id="21" Lemma="&ptoc" postag="n-p---ma-" relation="0BJ" /> 
<word form="kai" head="18" id="22" Lemma="Kai" postag="c " relation="COORD” /> 
<word form="6Tav” head="27" id="23" Lemna="6Tav" postag="c-------- " relation="AuxC" /> 
<word form="€A6n" head="23" id="24" Lemma="Epxopau" postag="v3sasa---" relation="ADV" /> 
<word form="h" head="26" id="25" Lemna="6" postag="1-s---fn-" relation="ATR" /> 
<word form="nopya" head="24" id="26" Lemma="nopeia" postag="n-s---fn-" relation="SBJ" /> 


<word form="gu" head="27" id="Z8" lenma="00" postag="p-s---mn-" relation="0BJ" /> 
<word form="a0Té&" head="27" id="29" Lemma="alté¢" postag="p-p---na-" relation="0BJ" /> 


<word form="néppw" head="22" id="27" Lemma="népmw" postag="visfia---" relation="ADV_CO” /> 


<word form="." head="@" id="30" Lemna="puncl" postag="u-------- " relation="AuxK" /> 
</sentence> 
3: XML of the treebanked original layer of the sentence [1]. 
® ovclaud.2.243 7 (> 101 @<200 @ Tiberiane Orh2 
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4: Screenshot of the main view from Sematia, when the document O.Claud. 
I] 245 is expanded (but O.Claud. II 243 and 246 are not). On the right, the 
metadata inserted in Sematia by the annotator is visible. The field “Same 


hand” is extensive with many documents also written in Petenephotes’s 


hand. The editor mentions that the writer is Petenephotes himself,311 thus 


he is both the author and the writer. Clicking from the blue “original” or 


green “standard” buttons would take you to the text, and clicking the paper 


icon next to those buttons, you could view the treebank XML. 
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F 
5: A screenshot of the search and results in Sematia for the dative in the 
original layer vs. the genitive in the standard layer. 
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6: A screenshot of the search in Sematia for a singular verb in the original i 
layer vs. a plural verb in the standard layer. 
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7: A screenshot of the search results in Sematia for the word form “ahabe 
In the “word” column, the words in green come from the “original” layers and 
the words in black come from the “standard” layers. In O.Claud. volume I, 
the form was not standardised according to the classical norm, whereas in 
volume II it was (with one exception). 


Linguistic Annotation of the Digital 
Papyrological Corpus: Sematia 


1Introduction: Why to annotate papyri 
linguistically? 


Linguists who study historical languages usually find the methods of 
corpus linguistics exceptionally helpful. When the intuitions of native 
speakers are lacking, as is the case for historical languages, the corpora 
provide researchers with materials that replaces the intuitions on which the 
researchers of modern languages can rely. Using large corpora and 
computers to count and retrieve information also provides empirical back- 
up from actual language usage. In the case of ancient Greek, the corpus of 
literary texts (e.g. Thesaurus Linguae Graecae or the Greek and Roman 
Collection in the Perseus Digital Library) gives information on the Greek 
language as it was used in lyric poetry, epic, drama, and prose writing; all 
these literary genres had some artistic aims and therefore do not always 
describe language as it was used in normal communication. Ancient written 
texts rarely reflect the everyday language use, let alone speech. However, 


the corpus of documentary papyri gets close. The writers of the papyri vary 
between professionally trained scribes and some individuals who had only 
rudimentary writing skills. The text types also vary from official decrees 
and orders to small notes and receipts. What they have in common, though, 
is that they have been written for a specific, current need instead of trying 
to impress a specific audience. Documentary papyri represent everyday 
texts, utilitarian prose,295 and in that respect, they provide us a very 
valuable source of language actually used by common people in everyday 
circumstances. 

This significant text corpus is openly available to us in digital form. 
The Papyrological Navigator (PN)296 hosts the Duke Databank of 
Documentary Papyri and provides a search engine as well. However, any 
deeper linguistic research cannot be performed. The search engine at PN is 
mainly designed for the needs of historians and editors of papyrus texts in 
locating parallels and sources using word-string searches. In order to utilize 
the text corpus linguistically, it needs to be enriched with linguistic 
information, i.e. it needs to be linguistically annotated.297 Linguistic 
annotation can concern many different levels of language, usually 
morphology, syntax, semantics, or pragmatics. Even the basic 
morphological annotation alone can provide for complex linguistic queries. 
The literary Greek corpus has recently been automatically lemmatized and 
morphologically parsed.29s Greek that is found in papyri deserves to be 
similarly treated so that the literary language can be compared with the 
utilitarian prose found in papyri, enabling our views on_ historical 
developments and variation of Greek language to be as full as they can be. 

In this paper, I will discuss the criteria and approach which I have 
chosen while planning the Sematia corpus and platform.299 While this is an 
ongoing process and plans often are subject to change, it is still worthwhile 
to explain what lies behind the selected approach, what the future plans are 
and possible new directions and, finally, what can be achieved with all this 
work. 


2Corpus design 


One key factor in corpus design generally is that the corpus is 
representative. Whether we want a holistic or strictly selected corpus, 
depends on the research questions for which the corpus is meant to provide 
answers. If we want answers from a certain domain of texts (e.g. private 


letters), we select only those texts into the corpus. Similarly, whether we 
want a synchronic or diachronic corpus depends on whether we want to 
examine changes in language used within a certain time span or not. In 
historical linguistics, corpora are generally diachronic. 

The papyrological corpus in PN is a growing and a changing one. It 
includes all published documentary papyri, and the Greek material ranges 
approximately from the IV century BC to the IX century AD. Newly 
published texts are added into the database by the academic community of 
papyrologists via the online Papyrological Editor (PE), where a board 
checks and votes on the submissions.300 Also, mistakes (typos or wrong 
readings etc.) in the texts that already exist in the corpus, can be corrected 
via the same Editor. This is one reason for the idea that Sematia should also 
be kept open-ended, so that ideally it could include the whole corpus, 
which represents the Greek used in documentary papyri for a period of 
about a thousand years. Thus, at the moment, the corpus design is a loose 
one, but users (both the annotators and the researchers who only wish to 
perform queries) can decide on a case-by-case basis what they want to 
annotate or include in their searches. Once a version of a text has been 
annotated, that annotation is stable, but if the system alarms us that there 
has been a change introduced into the base text in the PN, the annotator (or 
someone else, for that matter) can renew the annotation on that text, if it 
seems warranted.301 The process of getting texts annotated is slow at the 
moment, since it is performed semi-automatically (more on this aspect 
below). The choice of texts to be annotated is not authoritatively dictated 
by us; the choice is made by the users, so anyone wanting to have a 
specifically chosen set of material, can proceed in annotating the papyri. 
This way s/he also makes a contribution towards the annotation of the 
whole corpus. And when there are more texts already annotated, each 
researcher may select his/her own subcorpus and perform queries only on 
them (either in the Sematia platform or after downloading all the selected 
annotations for external use). The latter option makes the research more 
easily replicable (a basic requirement in corpus linguistic research). 

Corpus design also includes deciding over the level of annotation and 
what features are annotated and how. At the moment, our basic approach is 
to include the morphological and syntactic annotation in the form of 
dependency treebanks. We follow the Ancient Greek and Latin Dependency 
Treebank system.302 Sematia is designed to provide a ‘basic’ level of 
annotation, because we have this holistic idea of the whole corpus 
eventually being annotated; the research questions must not in this case be 
strictly decided beforehand. However, since the automatic morphological 


parsing has been performed on literary texts as mentioned above, this is a 
logical next step for the whole papyrological text corpus as well. This, in 
turn, would make the manual syntactic treebanking somewhat quicker, as 
the morphological forms would be more accurate than they are now to 
begin with (on the process of annotation, more detailed description below). 


3How to annotate papyri? 


Why should we devote a section on how to linguistically annotate papyrus 
texts? Because the papyri represent ancient textual material often preserved 
in a fragmentary condition. The organic writing material has suffered 
damage of many kinds. But, due to the importance of papyri as a source, 
papyrologists work very hard on reading, transcribing, and reconstructing 
them, i.e. editing the text, so that other researchers can also use that source. 
Still, many gaps and question marks can remain in the editions. All this is 
encoded within the text in the digital edition, in TEI EpiDoc XML, 303 and 
for this reason we do not have simple access to the raw text that could 
simply be uploaded for some linguistic annotation tool. In fact, the editorial 
work gives us plenty of material that we can and should also use in the 
linguistically annotated corpus. Therefore, we need to preprocess the texts 
available in the PN in a certain way. 


3.1Preprocessing 


The Sematia tool was first developed mainly for the above-mentioned 
preprocessing need. It creates two parallel layers of the same text; one 
being a sort of diplomatic edition (called “original”), and the other 
including the editorial suggestions (called “standard”). The tool has already 
been described in another article,304 thus I will not present the details here. 
What makes the Sematia corpus special, is that both of these parallel layers 
are linguistically annotated. This way it is possible to study only the 
version that has truly been preserved for us (the original layer), or to 
compare the actual preserved text with its standardized version. The 
differences in this comparison can be turned into a third layer (called 
“variation”), which I will briefly discuss later. 


3.2Annotation 


In order for this corpus to be beneficial for all Greek linguists, I decided 
that we should follow the same scheme and standard used in the corpora of 
ancient Greek. This means the Ancient Greek and Latin Dependency 
Treebank that includes Greek literature. In addition, the PROIEL treebank 
(New Testament and some Greek prose) follows the Dependency 
Grammar.305 In the annotation of papyri, we follow the Guidelines of 
AGDT.306 At the moment, we use the external annotation environment, 
Arethusa, provided by the Perseids Platform,307 with which we have an 
API integration in Sematia. This means that a text can be exported directly 
from Sematia to Perseids and Arethusa, and after it has been annotated, a 
member of the Sematia board (at the moment the project director) goes 
through the annotations in Perseids and either accepts or returns them to 
the annotator to be corrected. After the approval, the treebanks are 
committed back to Sematia (both the GitHub repository and the online 
site). 

The process of annotation in Arethusa includes the tokenization; i.e. 
tokenization is done when the plain text is imported into Arethusa, not into 
Sematia. The text receives an automatic lemmatization and morphological 
tagging (by Morpheus). But all the lemmas and morphological tags need to 
be checked and corrected by the human annotator; there are several forms 
and lemmas in the papyri, which Morpheus, being designed for classical 
Greek, does not recognize, for example the Egyptian names. Moreover, 
Morpheus does not do well in selecting the correct form from several 
homonyms. The syntactic annotation and dependencies have to be 
performed manually by the annotator. In other words, using Arethusa is 
convenient up to a point; but it is also quite laborious and thus expensive as 
it needs human resources: skilled annotators and their time. Nevertheless, 
in the end, we do get accurate annotations that can most likely be used in 
training automatic syntactic and morphological parsers in the future. 

The process can be presented by an example with images. Our sample 
sentence is the second sentence of a letter from Petenephotes to Valerius, 
written on a potsherd in the garrison of Mons Claudianus in the Eastern 
Desert (O.Claud. I 245,2—7; mid II century AD): 


[1] [kaAdc] |3 nvtjotc, Gere, sa[v EXON] |4 7 mopreE TH voKTI TADTH \ TELWWOs LOL 
'|5 toia Cebyn Gpt@v éxi od €|°yo Gptovs Kai Stav éAOH 1 mOl"pya TéLYOo ov 
avTd. 


3. 1. noujosic 4. /. nopsia 5—6. 1. élyo 6—7. 1. mol|psia 7. 1. cot 


Please, brother, if the caravan arrives tonight, send me three pairs of bread as I 
do not have any bread and when the caravan arrives I shall send them to you. 


Note that the apparatus has several corrections (standardizations), but not 
for the Ve confusion in the conjunction ézi (I. émei), 1. 5. This is the 
standard practice in this edition. Other so-called orthographic mistakes are 
usually standardized in the apparatus, but not the most common one 
between 1/ 1, because the editors apparently consider this such a common, 
parallel variant that it can no longer be considered as a ‘mistake’ (see also 
the chapter by J. Stolk in this volume for problems that this type of fluidity 
between editorial corrections can cause). 

The standard and the original layers of this sentence in the Arethusa 
treebank tool are presented in Figg. | and 2. Only the syntactic trees can be 
seen in the screenshots, and only one lemma/morphological analysis (that 
of the highlighted word), in this case the conjunction mentioned above. 
This is emphasized here, because an automatic parser would automatically 
take this word as the preposition ézi, but when the human annotator checks 
the sentence, s/he notices that the preposition is not the correct 
interpretation, and can make the necessary correction, even though the 
word is not editorially corrected in the original electronic source of ours, in 
the PN. 

The differences between the layers are apparent in the images; the 
supplied text, for example, is not annotated in the original layer, it is 
represented with a dummy marker SU so that the annotator notices that 
something is missing there and the supplemented word does not end up in 
the corpus of original layers. This also leaves some of the branches of the 
sentence tree hanging in the air, as some words that would be the heads on 
which other words depend on, are not preserved in the papyrus. The non- 
standard orthography in the original will not prevent the annotator from 
recognizing and marking correct lemmata for the forms, thus lemma 
searches will find all variant spellings of the words from the original layers. 
The underlying XML forms show us what the whole annotation entails. 
Fig. 3 presents the XML of the original layer’s annotation of the same 
sentence [1]. 

We can see that the annotation includes the existing form of the word in the 
sentence, its head, its lemma, the postag and the syntactic relation of the 
word in the sentence. The postag includes the whole morphological 
analysis: part-of-speech, person, number, tense, mood, voice, gender, case 
and degree. For example, the form mzepwac (word id 13) is a verb, singular, 
aorist participle active in the masculine nominative. The postag gives the 


very basic morphological analysis, and we could occasionally hope for 
something more specific, such as distinguishing proper nouns from 
common nouns or possessive pronouns from other pronouns, but as of this 
moment, Morpheus gives us these. In the future, other automatic parsers 
might take these distinctions into account more easily. However, even this 
morphological analysis enables us to search complex linguistic structures, 
especially when combined with the lemma and syntactic annotations. This, 
I think, is sufficient to fulfill the need of basic linguistic annotation for the 
Sematia corpus. Other levels of annotation, e.g. semantic or information 
structure annotations, would take considerably more time and effort. 


4Metadata and its purposes 


The date and place of origin of each text are vital when we wish to see in 
which time periods and in which areas certain linguistic features appear. 
They are generally provided in the papyrus editions and presented also in 
the PN metadata field, from whence they are automatically drawn into 
Sematia. 

As mentioned already in VIERROS — HENRIKSSON 2017, we add some 
metadata, which is not available in PN, namely aspects relating to the 
handwriting and the writers vs. authors. Some changes have been planned 
for these metadata fields and they will be implemented in the near future. 
The purpose is to identify text parts written in the same hand. When 
imported to Sematia, each document is divided into ‘acts of writing’ by the 
element <handShift>, i.e. each section written by a different writer receives 
its own layers and treebanks. Since there are often papyrus archives in 
which the same hand can have written several documents, it is important to 
link these acts of writing together, so that we can also try to study idiolects 
and compare certain writers to others. At the moment of writing, we can 
add metadata concerning the handwriting30s and concerning the writer, the 
author and the addressee.309 See Fig. 4 for an example on the metadata in 
O.Claud. H 245 (which only has one hand). In many cases, however, the 
name of the actual writer is not known, e.g. in private letters the sender of 
the letter is taken as the author, but the actual writer is not necessarily the 
same person as the author, nor is he named. In contracts, the names of the 
contracting parties are mentioned, but the scribe who draws up the text or 
who pens down the letters onto the papyrus often remains unnamed. 
Therefore, the 7rismegistos People ID cannot be used in identifying the 


hands, since we have so many hands without names to connect them with. 
Our intention is to give each hand an ID of its own. The hands that have 
been identified to come from one writer (sometimes a very difficult task), 
can be connected to the same ID. The hand-ID will make the current 
metadata field “Same hand” obsolete.310 

For the purposes of studying linguistic register and features typical of 
certain text types, we have also included the fields in which we can insert 
metadata on the text type and the addressee. 


5Sample results, i.e. what queries can find 


The treebank XML files (including the metadata) in Sematia can be 
exported for querying in external treebank query tools.312 It is possible to 
export the treebanks of all layers together, or choose the original or 
standard layers separately. I will not go through all the possibilities the 
external search engines can give for linguists;313 I will describe some 
sample searches that can be performed on the Sematia site itself.314 There, 
too, it is possible to search only from the treebanks of the original layers or 
only from the standard layers, but one of the essential features is the 
possibility to find instances where the original and standard layers differ. 
This is where we can get more deeply into linguistic variation. For 
example, it is very simple to search for instances where one grammatical 
case is used when editors have thought that a different case would have 
been more understandable, or more standard (what the editorial 
standardizations might have meant in different times when papyri have 
been edited, see the chapter by J. Stolk in this volume). The search fields in 
Sematia employ Regular Expressions (regex). The searches can naturally 
be limited in multiple ways, either by metadata fields or by the other field 
related to linguistic annotation, e.g. searching only objects or subjects, or 
only verbs or pronouns. More complex searches combining several words 
or forms would need to be made externally. 

An example search concerning the grammatical case, the dative instead 
of the genitive, is presented in Fig. 5. Since the postag holds the case in the 
8th place of the string, we can use the values for dative (d) in the original 
layer’s postag field and genitive (g) in the 8 place in the standard layer’s 
field, and let other places of the string be whatever else by using the 
wildcard (.); the beginning of the string is marked by (“). The values (d) 
and (g) can have different meanings in other positions in the postag, thus it 


is good to define the exact location. In other words, when using the search, 
it is vital to know how the annotations have been made, i.e. what each 
symbol means e.g. in the postag field. The guidelines of annotation need to 
be known and understood. 

The search gives eight results with the limited data we have in Sematia at 
the moment (2017, ca. 100 papyri). The result list can be ordered according 
to different fields, in Fig. 5 it is ordered by the document name. We can see 
that some of the instances may, in fact, signal orthographic confusion based 
on phonological variation rather than case confusion (e.g. Neyovutat / 
Neyovtov),315 but some of the instances more clearly tell that the writer 
has, for some reason, really chosen the dative rather than the expected 
genitive (e.g. Mapovati / Mapovatoc). Similarly, we could bring up e.g. all 
prepositions in the texts by simple postag query (“r), or see where singular 
verb forms appear instead of plural verb form (“v.s vs. “v.p). In the latter 
search, the results again point to the interplay of phonological factors 
confusing the morphological interpretations. See Fig. 6, where two out of 
three of the singular vs. plural verb form are forms consisting of graphemes 
at / s, both marking the phoneme /e/ at this time, and the third one has a / € 
confusion, which was also perhaps due to weak pronunciation of the 
unstressed vowel. These results give us material for further research on 
phonology playing a part in the morphological mergers in Greek, and the 
impact of education in writers’ ability or inability to use standard 
orthography in such occasions, but they also provide us with material for 
enhancing our tools in the future. 


6Future plans 


A variation layer has been on our agenda since the beginning and it was 
discussed already in the previous article to some extent.316 With the above 
described method of comparison between the original and standard layers, 
we can only find variation (instances where there really are differences 
between the layers), when there is a regularization in the PN, or when the 
annotator has marked these differences in the treebank XML after seeing a 
difference not available in the PN version. These comparisons and 
differences are planned to be automatically retrieved into Sematia to form 
the basis for the variation layer. In addition, we do need a way to manually 
encode other types of linguistic variation in this layer for several reasons. 
For example, there is a need to further specify certain differences as more 


phonological or more morphological in nature. Secondly, some variation is 
impossible to detect from the annotations when the postag does not really 
describe what we have in the text. I will give an example of this type of 
case with one sentence from a letter written by Ammonius to Apollonius 
(O.Claud. I 155,3—5; I century AD): 


[2] Apzarotos 6 KiPapiatys eil4pnké por dt émtotoAny éAa|5Ba x0 Tic 
YOVAUKOG LOD. 


Harpaesius, the cibariator, has told me that I have got a letter from my wife. 


The form éaBa, “I got”, has not been corrected in the apparatus, even 
though it represents mixed morphology; the aorist of the verb AauBava 
would be édafov according to the classical standard (the second i.e. 
‘strong’ aorist), but in the Koine the athematic endings of the first i.e. 
‘weak’ aorist (-a for the first person) were occasionally used (and they are 
the ones used in modern Greek).317 In the Mons Claudianus ostraka so far 
annotated in Sematia, there are nine attestations of the form éAafa (plus 
three times written as aiAaBa),318 but the editor has fluctuated in correcting 
it in the apparatus (see Fig. 7). We can find this word by using the word 
search, but as can be seen from the postag, it is not possible to indicate this 
type of variation there; the postag is the same in both éAaBa and éAaPov: 
first person singular aorist form. It would be very convenient to mark this 
up in the separate variation layer as mixed morphological endings in the 
aorist. 

We will be developing Sematia and similar tools further.319 One idea is to 
have the whole papyrological corpus already present in Sematia, and 
updated in set intervals, i.e. there would no longer be the need to import 
texts individually. Phonological searches will be enabled on the whole 
corpus. We also aim at developing an automatic morphological parser for 
Greek found in papyri, with more accurate analysis than what Morpheus 
currently has. 
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1Introduction 


Linguistic variation in documentary papyri has been noticed by editors 
since the early days of Papyrology. Some editors make occasional 
comments about variant spellings320, others decide not to mention them at 
all. Kenyon explains his reasons for refraining from marking variation in 
the introduction to P.Lond. I: 


It is not to be supposed that any human transcript can be entirely free from 
errors; but the palpable blunders in spelling and grammar with which the papyri 
abound may be credited in the first instance to the original scribes. It has not 
been thought worth while to disfigure the pages by appending the warning sic to 
each such violation of conventional rules321. 


In BGU I (1892-1895), the first “truly papyrological edition” according to 
Van Minnen,322 the editors added to some transcribed words, such as 
BiBAstdtov, a note in the critical apparatus saying “/. BiBAtSiov” (BGU I 2, 
n. to |. 17).323 The method of the ‘Berlin editors’ is followed by Grenfell 
and Hunt in their editions published in P.Grenf. II (see p. xii) and P.Oxy. I. 
They also briefly explain where they consider such a note to be required: 


Faults of orthography are corrected in the critical notes wherever they seemed 
likely to cause any difficulty.324 


In 1931, this by then customary practice of regularization was included in 
the ‘Leiden conventions’ during the International Congress of Orientalists 
in Leiden (7-12 September 1931). One would expect that the decision 
about a unified system of critical signs would be followed by a discussion 


on how to use them. Whereas several scholars have indeed commented 
upon the precise meaning and use of some of the signs, such as the 
underdot, little explanation has been provided about the practice to 
regularize the Greek language in papyrus documents.325 Herbert Youtie 
describes the process as follows: 


Immediately after the text the papyrologist puts a critical apparatus in which he 
gives conventional equivalents for vulgar or mistaken spellings.326 


This leaves the most important questions unaddressed, such as ‘to which 
forms should one apply this procedure?’ and ‘what is a conventional 
equivalent?’ 

Regularization implies a norm from which the attested variant deviates. 
This norm is generally not explicitly formulated in editions and rarely 
discussed in secondary literature. This makes one wonder whether editors 
always use the same norms. Whereas the early papyrus editions had to cope 
with readers that were unfamiliar with the Koine Greek language, advances 
in Greek linguistics and the large corpus of papyrus editions published to 
date have made most modern readers more accustomed to the features of 
Koine Greek. May this have changed editorial practices? The digitization of 
papyrus editions in the Duke Databank of Documentary Papyri (DDbDP) 
required a level of standardization across all editions. How did the 
digitization process influence the consistency of traditional methods? These 
editorial practices have not been studied before, while they form the basis 
for our modern tools and digital editions. In order to develop new tools and 
new methods for digital editing, I consider it important to examine how the 
current ones are functioning and how we can use existing methods to 
improve digital technology. 

In this paper I analyse the results of a system of editorial regularization 
which has been in practice for 125 years. The study of editorial practices in 
the past and present is executed by means of the new Trismegistos Text 
Irregularities tool. This tool collects all editorial interventions that are 
annotated in the Papyrological Navigator (http://www.papyri.info) and 
allows for detailed searches and analyses of the attestations.327 I will first 
give a short overview of the parts of the Leiden conventions that are 
relevant for the regularization of language and their current application in 
the digital editions in the Papyrological Navigator (section 2). Then, I will 
discuss the past and current use of critical signs and regularizations in the 
critical apparatus in the original and digital editions (section 3). The 
possibilities for categorization of variation and different standards are 
examined in section 4, followed by a concluding section on how we may be 


able to combine the traditional and modern aims in the development of new 
digital tools (section 5). 


2The ‘Leiden system’ 


At the 18th International Congress of Orientalists in Leiden (7-12 
September 1931), the participants of the Papyrology section discussed the 
usage of critical signs in editions of inscriptions, papyri and literary 
authors. They decided on a unified set of conventions, later referred to as 
the ‘Leiden system’.328 As this was designed to be a universal system for 
editions of documentary and literary texts, it contained several elements 
which might seem redundant for editing documentary papyri. Two sets of 
brackets were chosen to represent scribal omissions and additions to the 
text, namely the angular brackets (...) for “lacunes” and “additions 
(lacunes comblées)” and the braces {...} for “interpolations”. Of course, 
interpolations that found their way into the original text through copied 
manuscripts are not commonly encountered in documentary material. 
Consequently, these two sets of brackets are in papyrological practice 
reinterpreted to represent straightforward editorial ‘additions’ and 
‘deletions’ of letters and words that were forgotten or added superfluously 
by the scribe of the document for various reasons. The remaining two 
categories of editorial intervention are “corruptions” and “corrections”. 
Both are indicated in the critical apparatus of documentary texts and are not 
distinguished formally in papyrus editions. Van Groningen added explicitly 
that corrections should never replace the text of the papyrus in the 
transcription (as done with literary texts).329 

The different types of editorial interventions are all represented in the 
EpiDoc schema used for marking up textual features in digital editions of 
inscriptions and papyri.330 Accordingly, the papyrological conventions used 
in the Duke Databank of Documentary Papyri include the angular brackets 
for “Characters erroneously omitted by the scribe, added by modern 
editor”, the braces for “Superfluous letters removed by the editor” as well 
as the option to put regularizations in the critical apparatus.331 The 
regularizations in the apparatus can be tagged in different ways in EpiDoc, 
namely as “Correction of erroneous characters” with the two alternatives 
marked by <corr> and <sic> and as “Regularization of dialect or late 
spellings, etc.” marked by <orig> and <reg>.332 Both are used in the 
collaborative online editing environment of the Papyrological Navigator, 


called the Papyrological Editor.333 This platform uses a non-XML 
representation of the EpiDoc schema, called ‘Leiden+’, in order to facilitate 
easy entry of new texts by its users.334 

All editorial conventions used in Leiden+ are explained to the user in a 
set of online guidelines.335 The Leiden+ Documentation tells the digital 
editor to distinguish between a “spelling correction” to be used for 
“correction of outright scribal error’ 336 and an “orthographic 
regularization” to be used for a “non-standard orthographic form’’.337 
According to the guidelines, critical signs should be used for spelling 
corrections as well, which reduces the practical difference between the four 

categories into two basic types. The PN is thus expected to encode 
1. ‘corrections’ by means of critical signs (for additions and omissions) 
and in the apparatus (for substitutions and more complex cases), and 

2. ‘regularizations’ of non-standard forms in the apparatus. 


3Editorial regularization in practice 


Although papyrologists have agreed on the methods to be used in papyrus 
editions, as described above, the application of these basic principles is not 
self-evident. Herbert Youtie already stated in his prolegomena to the textual 
criticism of documentary papyri: 


it is a far cry from subjective opinion to objective reality, although no hint of 
this difficulty is ever betrayed in the definition of the signs that we find in 
papyrological manuals.338 


While this may apply to all critical signs, it is especially true for the 
editorial regularizations of the language found in papyrus documents. I will 
illustrate this by some examples mentioned below.339 

Following the basic distinctions available in EpiDoc (see section 2), I 
will distinguish between the so-called ‘corrections’ indicated by means of 
critical signs and in the apparatus (section 3.1) and ‘regularizations’ in the 
apparatus (section 3.2). The starting point for this comparison is the 
database of TM Text Irregularities, which contains a collection of all 
editorial regularizations in papyrus editions in the PN.340 There are two 
stages to take into account: the regularization indicated in the editio 
princeps and the annotation in the digital edition in the PN. This method 
will allow me only to quantify the outcomes of the second stage of this 
process. It should be noted that the digital edition in the PN is not always a 


true replica of the original edition, as more regularizations have been added 
in an attempt to level out the differences in conventions between various 
(older) editions. Hence, for every example mentioned below, I will also 
compare the digital regularization with the one in the original edition in 
order to reflect on possible differences between the two stages of editing. 


3.1 Corrections and critical signs 


The EpiDoc schema offers the possibility to distinguish between 
corrections of scribal errors and orthographic regularizations (see section 
2). The application of a special ‘correction’ tag results in the addition of 
(corr) after the corrected form in the apparatus of the digital edition. In 
practice, it has never been in frequent use and some earlier instances have 
been automatically converted into regularizations. The remaining 140 
corrections might have slipped through the net at an earlier stage or may 
have been added later, as users are still confronted with guidelines 
mentioning this option.341 

A closer look at the instances that are encoded as correction at the 
moment reveals that a significant part of them does not seem to fit the 
definition of “outright scribal error’. Regularizations of interchanges 
resulting from phonological mergers, such as ic to sic in O.Claud. IV 723 
and Hapadioov to Hapadsioov, Agipa to AiBa and [a2]odd0@ to [az]od@om 
in SB XXVI 16796,10-11, 16, are regularly found among these corrections 
(50 times).342 Morphological regularizations are also common (41 times). 
For some of those, it is possible to see why the (digital) editor regarded 
them as scribal errors. For example, in BGU XVII 2682,6, 9-10, the scribe 
mechanically added the standard accusative object yopiov apumeAucdv, 
whereas in this particular construction ([O]HoAoy@ ... pepidav | Liav yopiov 
G@pteAKOv) the noun phrase should have been a genitive partitive to the 
object pepidav | ptov.343 In P.Gen. IV 192,10-11, the pronoun oot was 
inserted too early and ended up with the wrong verb: OuoAoy@ éyew oot Kai 
| ypewmotetv instead of OnoAoya Exe Kai | ypewotetv cot. Printed editions 
do not make a distinction between mechanical scribal errors and other 
regularizations, although they sometimes provide an explanation for the 
variation in the commentary (as was done for BGU XVII 2682, n. to 1. 10). 
Apart from those occasional comments, the interpretation of the distinction 
between regularization and scribal error depends largely on the person 
digitizing the edition. The phrase ody vavAaic Ké EKaTOOTIS was 
regularized as “/. vabAotc Kai Exatootaic” in the apparatus of P.Jena II 8,7, 


but vadAoic was entered into the PN as a correction, kai as regularization 
and ékatooti|s as regularization (probably mistakenly for éxatootaic). 
Obviously, the distinction between the two types of regularizations creates 
a great challenge for the digital editor, especially without a clear definition 
of ‘scribal error’ at hand. 

Besides the special correction tag, simple scribal errors can also be 
indicated with critical signs according to the guidelines (see section 2).344 
The angular brackets (for editorial additions) and braces (for editorial 
deletions) are in common use in both printed and digital editions. In TM 
Text Irregularities, we collected a total of 6,920 instances of the use of 
angular brackets and 3,063 attestations of braces in the digital editions in 
the PN. Both of them are primarily used for scribal omissions and additions 
of whole words, amounting to 66% and 80% of the instances of the angular 
brackets and braces respectively. This also forms the main distinction 
between the use of critical signs in the text and regularizations in the 
apparatus: the critical signs mark additions and deletions of whole words, 
while regularizations are almost exclusively limited to parts of words.345 
However, the critical signs are also used for single letters and parts of 
words and in this usage they often overlap with the regularizations. Around 
82% of the angular brackets and braces put around part of a word are in 
fact used to indicate interchanges at a phonological and/or morphological 
level, such as (¢) or {e}1 (160 times) and the addition and omission of final 
-¢ (237 times) and -v (198 times). If one would want to achieve a 
meaningful difference between the use of critical signs and regularizations 
in the apparatus, critical signs in the text should not be used for 
orthographic and morphological interchanges affecting only a single letter 
or part of a word.346 


3.2 Regularizations in the apparatus 


Regularizations are traditionally indicated with ‘/.’ for /ege “read” in the 
apparatus of an edition. They make up the majority of all instances of 
editorial linguistic intervention in papyrus documents (92 %), amounting to 
more than 120,000 instances in all digitized papyri. Most of the editorial 
regularizations concern orthographic variation caused by changes in the 
pronunciation of Koine Greek (70%). Another significant part of the 
regularizations affects the spelling and use of morphemes (26%), such as 
case and verb endings. I divide the variation at a morphological level into 
two types: 


1. morphological interchange between different declensions or 
conjugations, such as the variation between an accusative singular in 
-a and -av for consonants stems or between the sigmatic and root 
aorist inflection of certain verbs,347 and 

2. morphosyntactic variation between the use of morphemes in a 
particular syntactic context, such as between a genitive or a dative 
case to express the recipient of a verb of giving or between an 
indicative or subjunctive following the conjunction iva. 


Both types occur among the regularizations in the apparatus. In some cases, 
morphological or morphosyntactic variation may be related to phonological 
merger as well. An example of this is the frequent interchange of 0 and a, 
of which one third of the instances are found in case endings (e.g. Tov / 
tv) and two thirds in other positions (e.g. @kt@ / OKtTa). It is, therefore, 
not always easy to distinguish different types of variation based on the level 
of language organization that they apply to. 

Almost 40% of the regularizations of orthographic variation concern 
the interchange of 1 and e.. For most of these variant spellings, 
regularization is not strictly necessary in order to understand the meaning 
of the word. Still, there are many forms for which one spelling has been 
regularized consistently in (almost) all instances throughout the corpus, 
such as ikoot to sikoot, éyIc to éyetc, éABiv to EADEiv etc. This is partly due 
to the addition of regularizations during the digitization process. For 
example, in the ed.pr. of P. Oxy. XLUI 3117 interchanges between 1 and «1 
are only regularized when they could be confusing (e.g. ézi to /. ézei in Il. 6 
and 14), but many others have been added in the digital edition, such as to 
BiBAsia in 1. 4, koweviv in 1. 5 and dxoKkpsivacGat in |. 6. The few instances 
where regularization in the PN is lacking may be caused by human error, 
such as the typo ‘mdAew for mde’ rather than dA in the digital edition 
of in POxy. XLII 3117,13—14 (http://papyri.info/ddbdp/p.oxy;43;3117), or 
the regularization of mepei to mepi in the ed.pr. of SB XX 14990,15,348 
which seems to have been overlooked in the Sammelbuch and the digital 
edition. 

There are also words for which the standard spelling may be more 
difficult to establish. According to classical rules, the suffix of the derived 
noun baepevere “excellency” is spelled with ¢1.349 This is also the spelling 
which is found in the majority of the VI- and VIl-century papyri, such as 
the attestations in P.Oxy. I 135-138 published in 1898. The alternative 
spelling bxepe@vua is not regularized in P.Oxy. I 144,4, nor in the five papyri 
P.Cair.Masp. I 67003, 67005-67008, published in 1911. Regularizations of 


bmepgvia do occur in editions that were published later, such as 
P.Ross.Georg. V 34,2 (published in 1935), CPR XXIV 27,17 (published in 
2002), and P.Oxy. LXX 4790,16, 19 and 30 (published in 2006). The 
alternative spelling in P.Oxy. I 144,4 became eventually regularized in the 
online edition. P.Cair.Masp. I 67003, 67005-67008 remain without 
regularization in their online editions.350 Remarkably, a regularization of 
the common form bzepoveia to basp@via was also added to the digital 
edition of P.Lond. HI 774—778.351 Whereas the earlier editions seem rather 
modest with regularizations of words that can be perfectly understood 
without, the growing need for consistency may have extended 
regularization to be applied to all ‘non-standard’ forms without agreement 
on the definition of ‘non-standard’. 

The Leiden conventions were designed to do reduce variation in 
editorial practices. The common format of a transcription with a critical 
apparatus containing regularizations becomes indeed the standard for all 
editions, but the variation in regularization practices continues in printed 
editions after 1931. The word BipAtlopvadxiov “archive” is spelled as such 
in 22 papyri and as BipAtlopvAd«Ketov in six papyri between the II and IV 
centuries AD. The spelling BipAiopvAdKewov is regularized to 
BiBALlo@vAdKtiov in the edition of P.Diog. 20, 6, and the online editions of 
SB VI 9625,23, PSI V 454,19, and P.Tebt. II 318,23; the normalized 
spelling is also found in the index of the last two editions. For the two 
editions that remain without regularization (P.Gen. [2 144,23; PHamb. I 
16,22), the spelling BipAto@vAakeiov was used both in the texts and indices 
of the original editions. Strikingly, the more common _ spelling 
BipAlopvAdKiov was even regularized to BipAlopvAdKetov in the first 
editions of P.Oxy. XXXIII 2665,17 and 19, PFam.Tebt. 15,iii,79, and 
P.Hamb. IV 244,12, and there are also editors that supplement the word in 
this spelling in abbreviations or lacunae (see BGU III, p. 2 to BGU I 
243,15, taken over in Chr.M. 216; Chr.M. 217,9, and P.Fam.Tebt. 29,44). 

Inconsistent regularizations, such as the ones mentioned above, often 
require careful analysis to determine whether this apparent lack of 
consistency can be justified in any way in each of the given situations and 
based on the material that the editors had at their disposal. Similarly, 
complicated situations arise when one attempts to regularize 
morphosyntactic variation. The phrase éév oov Th tTOxnt d0Ey “if it seems 
right to your fortune” occurs regularly in petitions from the I and III 
centuries AD. The second person singular pronoun is usually in the 
genitive case (cov), but it is also attested in the dative case (cou). The 
dative oot is regularized into a genitive cov in SB XXIV 15915,6, while 


SB XVIII 13732,13, regularizes the common genitive into the dative in this 
phrase.352 Confusion about the use of the dative or genitive case in these 
types of constructions is common among both scribes and editors and 
regularization is often far from straightforward.353 

Inconsistent regularizations are usually caused by a lack of agreement 
about the method of standardization. Differences between older editions 
have not always been levelled out during the digitization process and they 
might even have gotten worse in some of the more complicated examples 
mentioned above. Some editions take a more extreme approach than others 
when it comes to choosing a method for regularization. Common itacistic 
spellings, such as siva and ic, are often regularized in papyrus editions, but 
not in the editions of the Mons Claudianus ostraka. This is probably 
because these particular interchanges are very common in these ostraka 
and regularization seems unnecessary.354 This practice is not entirely 
consistent throughout the volumes (e.g. O.Claud. IV 723 and 839 
regularize ic, but O.Claud. IV 724 and 840 do not). Regularizations have 
been added during the digitization process in accordance with other 
papyrus editions, but the end result is still far from uniform (e.g. ig has been 
regularized in the digital editions of O.Claud. II 248 and 276, but not in 
O.Claud. II 363 and 383). 

Comparison between texts in the same volume and among other 
parallel texts is a common practice, but it is not the main method of 
regularization in most papyrus editions. The word vocokouetov “hospital” 
is attested in full in 14 papyri dated to the VI and VII centuries. Only one 
of those attestations is spelled with ¢ (SB I 4668,4), as is also common in 
modern Greek, while all the others write voookopiov.355 Based on 
comparison to contemporary documents, no regularization seems required 
for the other instances. In reality, regularizations to the standard spelling 
voooxoupesiov are found in original editions (e.g. P.Bodl. I 47,12, 20 and 26; 
CPR XXII 2,1, 5 and 9) and digital editions (e.g. P-Amh. II 154,2 and 8; 
P.Lond. II 1324,7), while others remain without any form of regularization 
(e.g. POxy. XVI 1898,19 and 38; P.Oxy. XIX 2238,18).356 This 
combination of different methods will inevitably lead to more 
inconsistencies within and between printed and digital editions in the 
future. 


4Standardization 


Past and current approaches have not resulted in a clear distinction between 
‘scribal error’ and ‘non-standard variant’ in the PN (see 3.1). The question 
remains whether it is possible to distinguish scribal errors from other types 
of variation and whether we should want to make such a formal distinction 
in (digital) editions (4.1). 

It has been shown that regularization of variation due to phonological, 
morphological and morphosyntactic changes is not always consistent (see 
3.2). Editors may use different methods to identify the norm and, 
consequently, these norms may differ from each other. In section 4.2, I will 
discuss various possibilities for establishing a standard for comparison. 


4.1 Scribal errors 


The traditional aim of textual criticism is the “Herstellung eines dem 
Autograph (Original) médglichst nahekommenden’ Textes”.357 Any 
corruptions to the text are caused by “the inability of scribes to make an 
accurate copy of the text that lay before them’”.35s Hence, any form of 
scribal intervention can be regarded as a mistake.359 Similar phenomena, 
such as misreading of the exemplar, orthographic variations and accidental 
alterations, occur in duplicate papyri, but not all documentary papyri are 
the result of copying.360 Therefore, our definition of scribal error has to be 
different from the one used for copying literary texts. 

Papyrus documents are the product of their own time and not the result 
of several centuries of transmission. Therefore, changes in the language do 
not need be regarded as scribal or copying errors in documents. Still, a 
division between scribal error and linguistic variation is commonly applied 
in linguistic approaches. Variation in the written language can be used to 
reconstruct changes in the history of the spoken language. In order to do 
that, significant variations, i.e. interchanges reflecting the spoken language, 
have to be separated from “Verschreibungen”361, “garbage errors”362 or 
“manifest blunders’363. Gignac identifies this difference between 
“phonetically significant variation” and “sheer mistakes and slips of the 
pen” by the principles of frequency and regularity: 


If certain letters or groups of letters interchange only rarely and irregularly, 
there might be another explanation. 364 


His other explanations include (a) anticipation and repetition, (b) inversion, 
(c) mechanical reproduction, (d) analogical formation and (e) etymological 


analysis.365 These examples of variation which occur irregularly and do not 
seem to reflect the spoken language can be described as ‘scribal errors’. 
Scribal errors of this type can usually be explained by common cognitive 
processes. 366 

Mechanical and cognitive processes may explain the appearance of 
scribal errors, but they do not constitute a comprehensive categorization or 
definition of the phenomenon itself. Haplography and dittography, for 
instance, are prime examples of the cognitive processes of anticipation and 
repetition (a). However, the simplification and gemination of consonants 
can also be explained by “the identification in speech of single and double 
consonants”.367 Hence, the example of “outright scribal error, e.g. 
ot[p]attedc for otpatnyéc” given in the Leiden+ documentation36s can 
also be explained by hypercorrective gemination of the consonant, the 
phonetic similarity of ¢ and y and the omission of y in the pronunciation as 
glide.369 Even the loss of a full syllable may sometimes have a phonetic 
explanation.370 Inversion (b) is another problematic category. Although the 
transposition of two letters may result “in spellings like atmoshpere which 
do not reflect an actual spoken form”, metathesis of a vowel and resonant, 
especially p, is relatively frequent in the papyri and may have had a parallel 
in speech.371 Mechanical reproduction (c) seems to identify a type of 
variation that is indeed limited to the written language, but the two 
remaining categories on Gignac’s list are not scribal errors strictly speaking 
either. Analogical formation (d) may not be caused by phonological 
changes, but can be indicative of morphological change in the spoken 
language, as is also acknowledged by Gignac.372 When a form can be 
explained by changes in the spoken language (phonological or 
morphological), it should not be classified as a scribal error according to 
the definitions mentioned above. Etymological analysis (e), such as the 
spelling of ék- in compounds before a voiced consonant, may not be 
relevant for the actual pronunciation of the word in later periods, but this 
change in orthographic conventions is better classified as orthographic 
variation than as a mechanical scribal error. 

Mechanical scribal errors in papyrus documents have received little 
study in their own right. Negative definitions prevail in the secondary 
literature aiming at the reconstruction of the original text or the spoken 
language. Gignac gives an excellent introduction to his method, but his 
overview of orthographic variations that are not phonetically significant 
cannot be used as a typology of scribal errors in documentary papyri.373 
Editors should feel free to discuss causes for variation in their 
commentaries and digital editors might want to continue experimenting 


with these distinctions, but it would be better to treat possible scribal errors 
in the same way as other types of variation in order to secure stable future 
reference to all variant forms. 


4.2 Different standards 


Regularization implies the use of a standard. Every editor who regularizes 
the language found on a papyrus compares the attested words and 
constructions with a certain norm. How and why this norm is chosen is 
usually not stated explicitly, but can be inferred to a certain extent from the 
patterns of regularization observed above (see 3.2). There seem to be two 
main sources for comparison: 
1. external sources, such as rules described in dictionaries, grammars 
and text books, and 
2. internal sources, such as other instances in the text itself or parallel 
texts that are ideally closely related in contents and context. 


Regularization to voooxopsiov, for example, was probably based on 
external criteria in most instances, since this spelling is rarely found in 
contemporary papyri. The spelling of ic, on the other hand, may have been 
left without regularization in the ostraka from Mons Claudianus based on 
comparison to other ostraka from the same area. As long as the attestations 
found in close parallels corroborate the external standards, editorial 
regularization of variant spellings tends to be consistent. As soon as both 
variants seem to be in regular use in contemporary papyri, such as with 
BipALlopvAdK(eyiov and repe@v(e)a, different editorial principles and 
methods may lead to conflicting results. 

It is not true that classical norms were especially used in the early days 
of papyrology and comparison with contemporary documents is an entirely 
new phenomenon. Variation in regularization practices is particularly 
common in early papyrus editions and classical norms are not consistently 
applied at all (cf. 3.2). Recent studies of the language of the papyri, often 
from a variationist perspective, have raised awareness of the possibility that 
scribal variation could be explained by its context.374 This may have led 
some editors to consider more context-sensitive methods, but also more 
practical considerations may have prevented editors from regularizing 
spellings that occur very frequently in a specific group of documents. The 
variationist idea that linguistic variation is dependent on its context is not 
an entirely new concept to papyrologists. The principle of comparison with 


parallel texts for understanding and supplementing another papyrus has 
been in use for a long time. In order to interpret the language used in 


papyri, Youtie suggests the use of dictionaries, grammars and “an 
unremitting search for parallels’”.375 He further notes that 


U. Wilcken has somewhere characterized papyrology as a “Parallelenjagd”. No 
term could be more apt. A good share of the papyrologist’s working time is 
devoted to searching for parallels.376 


Parallel examples are essential for a papyrologist to get familiar with the 
language and contents of different types of documents, to date the text and 
to identify the standard clauses used at different times and places.377 Even 
though this method has been used for many years to interpret new texts and 
to supplement words and phrases in fragmentarily preserved papyri, it is 
not always deemed suitable as a standard for linguistic comparison. 
Classical orthography and morphology are often understood to be the only 
proper standard for the language used in regularizations, in supplements of 
abbreviations and in lacunae.37s Kapsomenakis already voiced his concerns 
about the artificial norms that tended to be applied to the Greek language in 


papyri: 


Ubrigens hat eine volksmafig frei entwickelte Sprache ihre eigenen Gesetze, 
denen sie folgen mu, wenn sie ihre Aufgabe, der praktischen Verstandigung zu 
dienen, erfillen will. Die Vulgarismen diirfen also diesen Gesetzen nicht 
widersprechen. Weiter hat die Verkennung der Rechte der Volkssprache dazu 
gefihrt, daB man viele Schreiberfehler entdeckte, die man nach der Methode 
beseitigen zu miissen glaubte.379 


Classical Attic norms continued to be used as the standard for orthography 
and morphology in post-classical periods, but it seems difficult to justify 
applying anachronistic norms in cases in which a variant form is frequently 
or even normally used in Koine Greek. Lack of the awareness of the norms 
for the language used in papyri can easily lead to misplaced regularizations, 
reconstructions and even readings.380 The discrepancy between classical 
Attic and contemporary usage as the norm for editorial regularization is 
probably caused by a general lack of information about contemporary 
norms, as has also been pointed out by Youtie: 


But it is perhaps lack of linguistic information which trips us most often. 
Sometimes this takes the form of insufficient regard for the general laws of 
Hellenistic Greek, sometimes it is simply failure to search out the similar 
passages which are available in other papyrus texts. Whatever its cause, it has a 
crippling action capable of twisting our texts into fantastic shapes.381 


Knowledge about Koine Greek in general and the linguistic norms applied 
in papyri in particular are essential ingredients for a good papyrus edition 
and may help to prevent many reading errors and problematic restorations. 
On the other hand, the standards for orthography, morphology and 
morphosyntax in Koine Greek have still received little attention in research 
to date and there is no reference work that editors can use to identify a 
standard for every word or construction. These norms can, therefore, only 
be identified by manual comparison among a selection of documents. This 
creates the typical gap between the use of external sources based on 
classical Greek and the contemporary internal evidence. 

Koine Greek has never become a general standard for editorial 
regularization, although there are some exceptions. An example of such a 
well-known orthographic norm in Koine Greek is the spelling of the verbs 
yi(y)vonat ‘to be, to become’ and yt(y)v@oKw ‘to know’. Mayser and 
Schmoll state that the spellings yivopa1 and ywwomoxm are used without 
exception in the Ptolemaic papyri and Gignac confirms that these are also 
the normal spellings in the Roman period.3s2 Accordingly, the spelling 
yivouat is usually not regularized and the Koine Greek spelling is used in 
most supplements of abbreviations of the verb.383 Still, regularization to 
yiyvouat is found in about a dozen editions (e.g. P.Bodl. I 17,1,9; P.Haun. II 
22,5; P.Oxy. LXIV 4441,x,27; P.Petra I 4,5) and has occasionally been 
added to digital editions as well (e.g. O.Claud. IV 798,6; P.Stras. VIII 
772,6, 9, 15 and 21). In contrast to the relatively limited number of 
regularizations of yivopot, the verb ywwwoxm has been regularized to 
ylyv®oka@ in more than a hundred instances. Most of these regularizations, 
however, concern verbs with other spelling irregularities (almost 90%), 
such as yewaoxkw to yryv@okew (e.g. P.Col. X 278,4; SB XXIV 16290,2 
and 16291,4).384 When regularizing these other aspects, the idea of the 
classical standard seems to have overruled Koine Greek spelling 
conventions. The verb yivoua is also frequently spelled as yeivoum, but 
this rarely provoked regularization to the classical spelling of the 
consonants. The fact that the spelling of the verb yivopa often serves as the 
prime example of language change in Koine Greek, may have convinced 
editors to take the Koine Greek spelling as the standard for this verb more 
often.3s5 The differences in regularization between these two comparable 
verbs clearly illustrate the competing principles of regularization. 


5Towards a new approach 


In the previous sections, I have illustrated the various practices and 
principles for editorial regularization as they have been used up till today. 
Editorial regularizations in the apparatus are used to indicate phonological, 
morphological and morphosyntactic variation (3.2), while critical signs, 
such as angular brackets and braces, are mainly used by the editors to mark 
the addition and omission of one or more words (3.1). When brackets and 
braces are applied to single letters or parts of words, their function largely 
overlaps with the regularizations in the apparatus. More study is needed to 
separate accidental scribal errors from other types of variation in the papyri 
(4.1). The same applies to establishing contemporary standards for Koine 
Greek (4.2). Both goals are worthwhile pursuing in separate studies in 
order to gain a better understanding of the use of language in papyri, but 
such a distinction between different types of variation or different standards 
might not be essential for establishing a more consistent practice of 
encoding linguistic variation. 

The question comes down to what we would like to achieve with 
editorial regularization in papyrus editions. Are we trying to correct 
accidental scribal mistakes in the way the scribe would have wanted to? 
Are we normalizing the language to conservative or contemporary 
standards? Or are we just helping the classically schooled modern reader to 
understand a text written in a different variety of Greek? This last idea was 
probably an important reason to start providing standard Attic equivalents 
in the apparatus, as Turner explains: 


The critical apparatus [...] can also usefully show how the editor understands 
his text. The word ‘read’ or symbol /. = ‘/ege’ need not mean that the Greek is 
incorrect: it is a sign of how it can be interpreted in terms of standard Attic 
Greek.386 


The fact that Turner has to explain what is not meant by this sign 
immediately points out that the use of the word “/ege” can be misleading. 
The command “read” is easily interpreted as a correction rather than an 
equivalent. This inherent ambiguity is worth noting here. Other, more 
appropriate, signs should be considered for future printed editions. For 
digital purposes, however, it would be better to take a different approach 
altogether. As the apparatus shows ‘how the editor understands his text’, 
the ideas about what should be explained in the apparatus and what not can 
differ significantly from one editor to the other. Some editors may follow 
this practice very strictly and always provide standard equivalents, whereas 
others may think that this is only necessary for forms that are less common 
and more difficult to understand for the reader of the edition, such as in the 


earlier editions of the Oxyrhynchus papyri. This results in a pragmatic and 
fluid norm for encoding variation. 

Fluid norms are not ideal in a digital environment. That is why it was 
attempted to make regularization more consistent in the DDbDP and in the 
Papyrological Navigator. Modern editors and the methods designed for the 
Papyrological Editor have succeeded in standardization of editorial 
practices in digital editions to a large extent, but consistent regularization is 
not always a straightforward procedure, as I have shown in the sections 3.2 
and 4.2. Variation may be governed by various factors and this means that 
the chosen method for regularization may sometimes determine the 
outcome. This causes problems, because there are no guidelines describing 
a particular methodology for regularization in papyrus documents. 

Digital technology, however, can do more than standardizing the 
practices of printed editions. I can identify three main aims for encoding 
linguistic variation in papyrus editions: 

1. to show readers of the edition how the editor interprets uncommon 
forms, 

2. to help papyrologists to search for parallels of words and phrases in 

various spellings, and 

3.to provide useful data for linguists studying the Koine Greek 

language. 


The current practices in editorial regularization and the search interface of 
the PN do not fulfil each of those aims equally well.387 The traditional 
method of regularization is not suitable to encode variation consistently and 
objectively. In order to achieve objectivity we need to apply the same 
treatment to all forms rather than to rely on the judgements of individual 
editors to identify which forms are ‘uncommon’ enough. One could do this 
by providing a reference to a headword, i.e. a lemma, to every word that is 
attested on a papyrus. It is already possible in EpiDoc to annotate a 
‘lemma’ attribute to every linguistic ‘token’.38s A hyperlink to a lemma can 
be very helpful for less experienced users and additional morphological 
annotation would give an opportunity to the editor to explain how every 
form should be interpreted. The encoding of lexical and/or morphological 
information for every word can be similar to the creation of onomastic and 
prosopographical references to every proper name.38s9 The lemma could be 
in classical orthography, as it is not meant as a correction or regularization, 
but only as a reference point for all variant spellings. A search query would 
yield all attested variants of the lexeme or morpheme in question. Such an 
overview of attested variants and their chronological and geographical 


contexts will show which form might have been in common use at any 
given time. Full text search queries should ideally separate between the 
results based on real attestations on a papyrus and results including 
supplements of abbreviations and restorations in lacunae added by editors. 
This will provide the papyrologist with a more realistic picture of the 
language used in papyri and this will benefit new editions in the future. 
Marking linguistic variation is not a bad idea in itself, nor is the attempt 
to standardize editorial practices in digital editions. However, in order to 
reach the full potential of these approaches, they need to be applied more 
rigorously and more objectively. Editorial regularizations that have been 
annotated up to now should not be discarded, but can be used to establish 
automatic recognition of the lemmata and their potential variants. Once 
such a digital tool is functioning properly, only the most uncommon forms 
would still need to be annotated manually, comparable to the original 
practice of regularization. There are different technological solutions and 
several possible platforms that would be suitable to achieve these aims. 
Until that moment, papyrologists and linguists will be able to explore the 
rich source of linguistic variation available in Trismegistos Text 
Irregularities, a collection of a long history of editorial regularization. 
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1Introduction 


There currently exist two main digital repositories for papyri: the well- 
known one of the Jntegrating Digital Papyrology (IDP) Project (http:// 
papyri.info) and the most recent one of the project Digital Corpus of 
Literary Papyrology (DCLP) (http://www.litpap.info). Both corpora, which 
can be downloaded from GitHub (https://github.com/papyri and _https:// 
github.com/DCLP, respectively), contain papyrological texts encoded 
following the de facto standard EpiDoc schema, a subset of the TEI schema 
specifically designed for encoding ancient documents390. 

The notorious complexity of papyrological texts is also reflected in 
their digital encoding, which challenges the digital humanist in many 
respects. The major problem is represented by the fragmentary nature of 
most papyri391. Papyrologists work to integrate texts to the best of their 
knowledge: sometimes just a few characters are missing, while other times 
full words need to be supplied, for a text to become meaningful (and even 
integrating single words is often not enough to understand a text). The 
degree of certainty for a given integration, then, varies depending on a 
plethora of factors, including especially the papyrologist’s expertise. 

The challenge posed by text integration deeply affects the TEI/EpiDoc 
XML encoding of texts, where specific elements, such as the supplied and 
unclear ones, are used to mark up additions. In both corpora, the markup is 
added inline (instead of standoff)302. As a consequence, since 
fragmentation in papyri often occurs at the word level, the XML markup 
can break up words: this phenomenon, as is known, raises computational 
issues when it comes to tokenizing while attempting to preserve a link to 
the information contained in the markup of the original document. 

In this paper, I document a first attempt to add morphological 
annotation and lemmatization automatically to all Papyri.info documents. I 
restrict my present contribution to this repository because this is the only 
one containing some texts which have also been manually annotated in the 


Sematia treebank393. The Sematia texts provide us with a gold standard 
which can be used to measure how well the MATE tagger394 (trained on 
literary texts) and a rule-based lemmatizer are expected to fare with respect 
to my automatically annotated corpus, which will henceforth be referred to 
as the MALP corpus (= M(orphologically) A(nnotated) (and) 
L(emmatized) P(apyri) corpus), available at https://github.com/gcelano/ 
MALP. The MALP texts preserve the URNs of the original documents and 
the line break reference system for each token. Because of the complexity 
of the TEI inline markup in the Papyri.info files, I have not attempted to 
create stand-off annotations39s. 

It goes without saying that having the possibility to search the corpora 
using morphological features and lemmas will positively impact our 
knowledge of the texts. This holds particularly true for a corpus containing 
a morphologically rich language such as Ancient Greek, which cannot be 
easily queried using simple graphic words. Moreover, morphological 
annotation and lemmatization dramatically speed up the treebanking 
process, the annotators taking advantage of some annotation being already 
present and correct to a large extent. 

Since the Papyri.info corpus currently amounts to 62,901 documents, 
the only doable way to add morphological annotation and lemmatization to 
all tokens is to rely on an automatic annotator. Currently the best 
performing POS tagger available for Ancient Greek is the MATE tagger, 
which has been trained on the literary texts of the Ancient Greek 
Dependency Treebank (AGDT)396. The accuracy measured is 88%397. 
Comparable accuracies have been reported for the recent UDPipe tagger39s, 
which however adopts an annotation schema different from that of the 
AGDT and the Sematia treebank. Lemmatization will be performed using a 
tule-based approach relying on the Morpheus morphological analyzer/ 
dictionary399, from which lemmas can be extracted by linking it to the 
papyrological texts via word forms + fine-grained POS tags. 

In Section 2, I will present the Sematia treebank, which contains 
morphosyntactic annotation for some of the papyri contained in the 
Papyri.info corpus: they will serve as a gold standard to evaluate the 
accuracies of POS tagging and lemmatization. In Section 3, I detail the 
sentence-split and tokenization of the papyrological texts. Section 4 shows 
how the morphological annotation has been added to each token, while 
Section 5 details the lemmatization process. Section 6 contains conclusive 
remarks. 


2A gold standard for linguistic annotation of 
papyri: the Sematia treebank 


The Sematia treebank (https://sematia.hum.helsinki.fi/user)400 provides us 
with some semi-automatically annotated papyrological texts, which can be 
used to evaluate the accuracies of the POS tagging and the lemmatization 
of the MALP corpus. The treebank currently contains 224 papyrological 
texts annotated for their morphology (semi-automatically) and syntax 
(manually) according to the Guidelines for the Annotation of the Ancient 
Greek Dependency Treebank 2.012, which were first designed for the 
annotation of the AGDT. The original texts come from the Papyri.info 
corpus, but no criterion for the choice of the kind of text is at the moment 
followed in the treebank. 

The morphological annotation and lemmatization process are 
performed using the Morpheus morphological analyzer, which was also 
used to help the manual annotation of the AGDT texts. Since the MATE 
tagger has been trained on the texts of the AGDT, both the Sematia texts 
and the output of the MATE tagger are directly comparable without 
conversion. 

The Sematia treebank contains two layers of annotation for each 
papyrological text: the “original” layer, which only consists in the linguistic 
material that has been preserved in a papyrus, and the “standard” layer, 
which is built on the “original” layer and integrates it with editorial work 
aimed to make the text intelligible. Our automatic annotation is evaluated 
only against the texts of the standard layer: they provide a much better 
input for the POS tagger, the text having been integrated with missing 
characters/words. 

Moreover, while the Sematia treebank contains different annotation 
files for each hand recognized within a papyrological text, I provide only 
one tokenization per text in the MALP corpus, considering the 
contributions of different hands as part of the same text, on a pair of any 
other integration. These texts were not used for comparison purposes to 
simplify the evaluation process (more precisely, 17 standard layer Sematia 
texts have been excluded from the evaluation phase, i.e. the ones with the 
following URNs (contained in <title/>): o.claud.1.139, o.claud.1.148, 
o.claud.2.227, upz.1.7, p.grenf.2.15, upz.1.59, and bgu.3.994. Moreover, 
the Sematia texts with URN cpr.30.28, o.claud.1.131, and o.claud.1.135 are 
also excluded in that the Papyri.info counterpart of cpr.30.28 does not 
contain terminal punctuation marks allowing (automatic) sentence 


detection, while o.claud.1.131 and o.claud.1.135 contain Latin texts. 

The number of Sematia texts which can be used for comparison with 
the ones automatically generated for the MALP corpus is therefore reduced 
to 92 (i.e., 112 standard layers texts — 17 — 3): they are henceforth referred 
to as the Sematia comparison corpus, which is compared with the MALP 
comparison corpus, i.e. a subset of the MALP corpus consisting in the 
exact same texts but sentence-splitted, tokenized, POS tagged, and 
lemmatized by using the algorithms developed for this study (available at 
https://github.com/gcelano/MALP). 

The Sematia texts have been sentence-splitted and tokenized using the 
Arethusa annotation tool. Punctuation in Sematia texts has been partly 
edited manually, so the number of sentences is necessarily different from 
that one gets by simply sentencesplitting the original texts found in the 
Papyri.info corpus. More precisely, the Sematia comparison corpus 
contains 486 sentences, while the MALP comparison corpus 461 sentences. 

The intersection between these two sets of sentences is represented by 
204 sentences which are exactly the same, i.e. contain the exact same 
tokens (with the exclusion of the elliptical tokens manually added by 
annotators in the Sematia treebank, which are simply ignored). I will use 
these 204 sentences to evaluate the accuracy of POS tagging and 
lemmatization for the MALP corpus. Even if the comparison corpus is very 
small, it will provide us with a rough preliminary estimate of the quality of 
the POS tagging and lemmatization for the MALP corpus, which can be 
easily calculated automatically. 


3Sentence split and tokenization 


Sentence split has been achieved using a rule-based algorithm identifying 
sentences on the basis of presence of the following terminal punctuation 
marks: the period, the semicolon (which, in Ancient Greek, has the value of 
a question mark), and the dot above the line (whose function is comparable 
to that of the colon and semicolon in English). Importantly, some ancient 
punctuation marks are encoded in the text as XML elements, such as <g 
type="mid-punct"/>. I have converted all <g type="mid-punct"/> into dots 
above the line, which serve here just as generic terminal punctuation marks. 

In the Papyri.info corpus there is a huge variety of g elements, which 
are used to encode glyphs401. Their meaning is not always clear with 
respect to sentence-splitting, and so they are ignored in the present study. A 


—" 


refinement of sentence-splitting will be possible by modifying the XQuery 
module at the relevant points provided at https://github.com/gcelano/ 
MALP. 

The XQuery algorithm needs to also be improved to correctly 
distinguish periods used as final punctuation marks and periods used as 
abbreviation punctuation marks: e.g., a single letter followed by a period is 
taken by rule as an abbreviation (and so the period is not tokenized). 
Papyrological texts can however contain a considerable amount of Ancient 
Greek numbers (expressed via single alphabetic characters), which are mis- 
tokenized if they are followed by a period at the end of a sentence. 

More in general, punctuation in papyri is a very complex matter, which 
should definitely receive much more attention than the one so far received 
both in the TEI/EPIDOC XML text encoding process and in the present 
study, where only ‘standard’ punctuation marks already present in the 
edited Papyri.info files have been taken into consideration, without any 
attempt of developing a more complex system for sentence detection. This 
has as a consequence that sentence-splitting in the Sematia treebank, which 
has been manually checked, is arguably more accurate (even if not 
uncontroversial). 

Tokenization has also been performed using a rule-based approach. 
Ancient Greek graphic words (i.e. space-separated words) are commonly 
treated as separate tokens. There are only a few arguably rare exceptions to 
this statement: e.g., crasis can be responsible for the phonetic and graphic 
merging of an article and a noun, as in OjyEépa (= TH Mwépa). The algorithm 
does not attempt to provide a solution for that (and the POS tagger itself 
has been trained on data where other-than-space-based tokenization is not 
always consistent). 

Since the TEI/EpiDoc XML files contain inline markup, one major 
tokenization-related task is to distinguish the ‘data text’ from the ‘metadata 
text’. For example, the TEI note element contains some additional 
information about a part of the text, as shown in the file o.amst.8.xml 
(within the Papyri.info repository), where, with respect to a few characters, 
the <note> element specifies “Writing perpendicular to main text”. The text 
node of the note elements should clearly not be tokenized, not belonging to 
the main text. 

A few TEI elements can contain textual content which has been added 
by an editor and can be considered as part of the text: e.g., the <supplied> 
element allows integration of missing characters. In jurpap.36.xml, for 
example, the textual evidence éxiuxaAovpévn is directly followed by the 
element <supplied reason= " lost">c</supplied> to signal that a final sigma 


got lost but is necessary to properly understand the text. A more complex 
case is represented by the element <choice/>, which is designed to allow 
different readings in a_ text4o2: for example, <choice><reg>tfjc</ 
reg><orig>tiv~</orig></choice> on line 11 in jur.pap.36.xml shows that 
two different variants of the feminine article are possible at that point of the 
text, being one the original text on the actual papyrus and the other one the 
modern ‘regularization’ of the non-‘standard’ spelling. When tokenizing, 
one should integrate the variants into the text one by one. 

All the details about the preprocessing of a papyrological text aimed to 
clean it up before tokenization are contained in the XQuery module. More 
precisely, the function Ip:clean-markup() deletes those elements which do 
not contain data text (such as the <note> or <bibl> elements) and shows the 
logic followed as for those elements containing mutually exclusive data 
text alternatives, such as the choice element. The XQuery module also 
details the rules for the tokenization itself. One tokenization problem which 
has not been dealt with in the present study is the case of those words 
splitted on different lines, as in o.narm.3.xml, where the word Kodakeiac is 
splitted into KoAa (linebreak 2) and xsiac (linebreak 3). The XQuery 
algorithm treats the two divided parts as different tokens, in that they are 
identified by a different line break number (@n in <lb/>), which is taken to 
be a major identifying reference point for a token (i.e., a token cannot span 
over two linebreaks). Future work should try to address this problem. 


4POS tagging 


POS tagging is the NLP task aimed to add morphological analyses to 
tokens. A model for the MATE tagger has been trained on the AGDT, so it 
is possible to tag Ancient Greek texts automatically. The model has been 
trained on literary texts, i.e. texts whose language is quite different from 
that of papyri. 

The token accuracy for POS tagging using the MATE tagger has been 
compared to the baseline accuracy consisting in assigning to each token the 
most frequent morphological tag associated to it in the AGDT. As 
explained in Section 2, the comparison is performed using the 204 
matching sentences of the Sematia/MALP comparison corpus as a gold 
standard. The total number of comparable tokens is 1,839. The word forms 
of the tokens of each sentence have been compared as for their POS tags 
and morphological features. 


Baseline MATE tagger 
0.56 0.62 


Tab. 1: token accuracy for POS tag + morphological features. 


Baseline MATE tagger 
0.67 0.76 


Tab. 2: token accuracy for POS tag only. 

In order to calculate the baseline accuracy we used the most frequent tags 
found in the AGDT and, if a word form is not present, the first POS tag + 
morphological features found in the Morpheus dictionary for that given 
word form. There is no clear order of entries for each token in the 
Morpheus dictionary, so this latter POS tag + morphological features 
assignment can be considered random. 

The tables above show that having a POS tagger both for POS tags only 
or POS tags + morphological features is useful to get better accuracies, 
even if the figures are admittedly not very high. 

One explanation for these low accuracies is that the vocabulary found 
in papyrological texts is very different from the one found in the AGDT: 
the distinct values of the tokens of the 204 matching sentences contained in 
the Sematia/MALP comparison corpora are 781, of which only 361 are 
found in the AGDT data (used to train the MATE tagger). Similarly, the 
Morpheus morphological analyzer can recognize only 395 tokens (out of 
the 781 distinct values). This is clear evidence that papyrological texts 
contain very different vocabulary from literary texts, which deserve special 
attention and development of specific resources. 


5Lemmatization 


An attempt of lemmatization is performed using a database consisting of 
entries from the Morpheus morphological analyzer/dictionary and the 
Perseus-Under-Philologic morphological dictionary (available but not 
downloadable at http://perseus.uchicago.edu), which is a refinement of 
Morpheus, with correction of its errors and addition of new entries. 

A typical entry of the database is like the following: 


<d n="500083" v="a-s---fa-#ayuetakivntov"> 
<p>a-s---fa-</p> 
<f>dauetakivntov</f> 


<| r="a--s---fa-">dyeTakivntoc</|> 
<e>dueTakivntoc</e> 
</d> 


Each d element is identified by a word form + morphological analysis, 
which a lemma from the Perseus-Under-Philologic dictionary (in the 1| 
element) and/or the Morpheus dictionary (in the e element) are associated 
to. I extracted from this database all the lemmas of those entries whose 
word forms and morphological analyses correspond to those of the MALP 
tokens (belonging to the 204 matching sentences). 


Lemmatization 


0.47 
Tab. 3: token accuracy for lemmatization. 


The result of the comparison between the lemmas in the Sematia 
sentences and the ones in the MALP sentences (containing lemmas added 
using the above mentioned database) is as low as 0.47. This result seems to 
be due to the low accuracy of POS tagging because the lemmatization 
process heavily relies on it: if the morphological tag associated to a word 
form is wrong, then the correct lemma is unlikely to be retrieved from the 
database. Another reason can be the fact that the vocabulary covered by the 
dictionaries is not complete: however, if the Morpheus dictionary only 
recognizes 395 tokens of the distinct values of the Sematia/MALP 
matching sentences, the Perseus-Under-Philologic dictionary has a much 
better coverage, which amounts to 582 tokens. 


6Conclusions 


In this paper I have documented an attempt to add morphological 
annotation and lemmatization to the Papyri.info corpus automatically. The 
results show that finegrained POS tagging accuracy is 0.62, while 
lemmatization accuracy is as low as 0.47. The fine-grained POS tagging 
accuracy is likely to be explained because of the vocabulary difference/ 
coverage between the literary texts of the AGDT, which the MATE tagger 
has been trained on, and the papyrological texts (the AGDT recognizes 
only 361 tokens of the 781 distinct tokens of the Sematia/MALP matching 
sentences). This has a direct consequence on the lemmatization accuracy, 
which has been performed using a rule-based approach depending on the 


POS tagging: if the association word form-POS tag is wrong, then it is very 
unlikely that the right lemma can be retrieved. In order to get better results 
both for POS tagging and lemmatization, a papyri-specific treebank, such 
as the Sematia treebank, should be expanded, so that it can provide enough 
data for the training of new models (for POS tagging/lemmatization/ 
parsing). Even if the accuracies calculated for the MALP corpus are not 
high, still the corpus is expected to be useful for data exploration. 
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1Introduction 


Texts are made of words, the meaning of the words creates the text, so that 
understanding the words means understanding the text. This assertion may 
sound obvious, but it actually hides a deep truth: reaching a more exact 
definition of the words can help to reach a more exact meaning of the 
whole passage in which they occur. Hence, the need for creating tools 
specifically conceived to study the words and their tight connection with 
the texts. In the digital and internet-led era we are living in, electronic 
technologies have been profitably applied to the Humanities and intersect 
with them in the scholarly field of the Digital Humanities (DH) to such an 


extent that it is hard to disagree with Jerome McGann’s incisive words: 


As with the Renaissance sped forward by the printing revolution of the fifteenth 
century, digital technology is driving a radical shift in humanities scholarship 
and education. The depth and character of the change can be measured by one 
simple but profound fact: the entirety of our cultural inheritance will have be 
reorganized and re-edited within a digital horizon.403 


In this context, online dictionaries and reference tools, as results of an e- 
lexicographic process,404 may acquire a special relevance and potential, 
notably if these resources are linked to a corpus of digital editions of texts. 
This is precisely the case with Medicalia Online, a digital lexicographical 
database of technical terms attested in the Greek medical papyri.405 
Medicalia Online is indeed strictly connected to the digital editions hosted 
in the Corpus of the Greek Medical Papyri Online (CPGM),406 the digital 
library of ancient medical texts on papyrus recently merged into the Digital 
Corpus of Literary Papyrology (DCLP).407 Like the CPGM project, 
Medicalia Online has been developed at the University of Parma (Italy), 
from 2014 to the end of 2016, in the framework of the ERC project 
DIGMEDTEXT funded by the European Research Council (Grant 
Agreement no. 339828) and directed by Professor Isabella Andorlini.4os 
Thus, given the close interconnection with the core database, Medicalia 
Online can be considered as both a supplement to and an expansion of the 
digitized corpus of the Greek medical papyri, as I will illustrate below. 


1.1 matter of definition 


Medical papyri represent a corpus of peculiar texts with a peculiar nature, 
that ranges from literary texts, notably treatises by known authors and 
adespota, since papyrus fragments not rarely preserve works more or less 
of the same status as the medical literature transmitted in medieval 
manuscripts, to technical texts conceived to convey technical knowledge, 
for instance technical handbooks, collections of recipes, school manuals 
and catechisms, to proper documentary texts, such as public physicians’ 
reports, petitions of private individuals and private correspondence 
concerning matters of health and diseases. So, besides the strictly literary 
texts, such a corpus mostly includes texts with a ‘borderline’ character, viz. 
combining features of papyrus documents with issues proper of the 
technical nature of medical writings, that are categorized — as in a sort of 
‘twilight zone’ — as ‘paraliterary’ or — in a vaguely pejorative way — 


‘subliterary’.409 Due to this complex and stimulating textual situation, it 
became clear since the beginning that a dictionary of short definitions of 
terms would not have fitted the exegetical requirements of ancient medical 
discourse. This entailed the necessity to broaden the goal of Medicalia 
Online to produce a rigorous and detailed reference collection of relevant 
lemmas critically discussed. An extensive and diachronic treatment of 
ancient Greek medical terms was indeed still missing from the scholarly 
landscape. It was decided to focus the attention on a selection of specimina 
with the aim of providing not merely brief explanations of many words, 
like in an ordinary ‘dictionary’, but a series of in-depth studies on selected 
terms. As a consequence, it is preferable to define Medicalia Online not as 
a simple ‘glossary’ or ‘dictionary’, but rather as a ‘lexicographical tool’ 410 
containing entries or articles with an encyclopaedic flavour, or, even more 
specifically, as a ‘specialized lexicographical tool’, being it devoted to the 
specific set of linguistic and factual elements of the specialist subject field 
of ancient medicine. 

Considering the vastness of the lexical material at disposal, since the 
papyri are a treasure-trove of linguistic information, the lexicographical 
process is still ongoing and potentially never-ending. Due to this aspect, 
Medicalia Online may fall into the category of the lexicographical tools 
“under construction” or better “dynamic’,411 according to the terms 
“Ausbauwo6rterbuch” (“dictionary under construction’) VS. 
“Abschlussworterbuch” (“completed dictionary”) introduced by 
SCHRODER 1997, 60, and ‘“dynamisches W6rterbuch” (‘dynamic 
dictionary”’) vs. “statisches Wo6rterbuch” (“static dictionary’’) preferred by 
LEMBERG 2001, 81. This means that Medicalia Online is not “a fixed 
object”, but a flexible entity, “an organic changing database”’412 that can be 
continually enlarged, as well as that its lexicographical process is an “open 
system’.413 Such a flexibility and the possibility of a constant improvement 
and updating represent the undeniable advantage of an online publication. 
Nevertheless, most of the entries already published online in the Medicalia 
Online database are going to be published also in print, in form of a 
collection of lexical studies, as part of a volume.414 


1.2Methodology and aims 


There are some keywords characterizing the methodology of Medicalia 
Online. The first and most important one is ‘interdisciplinarity’. The 
significance of interdisciplinarity in the new trends of Papyrology has been 


stressed several times in recent years. Suffice it to remember how often 
expressions like “broader concept” and “broader view”, as well as words 
like “combination” of sources, and, of course, “interdisciplinarity” occur in 
The Oxford Handbook of Papyrology edited by Roger Bagnall in 2009. So, 
whilst the core focus of Medicalia Online is papyrological and the evidence 
of medical papyri plays a leading role, a systematically interdisciplinary 
approach inspires the inner nature of its lexical studies. This contributes to 
broaden the horizon of the database to a wide range of perspectives, since it 
provides at the same time a papyrological, linguistic, archaeological and 
historical-scientific overview of the studied items. Such a methodology, 
indeed, involves a strong sense of dialogue and cooperation among 
disciplines, merging and combining components of several subject areas 
beside papyrology: classics and history of textual transmission, digital 
humanities, linguistics, epigraphy, archaeology and material culture, history 
of science and of medical practices across the ages. It entails a critical 
analysis and a comparative examination of all the typologies of sources on 
which the study of the ancient medicine draws upon, from the written ones, 
i.e. papyri, literary passages (first and foremost works on medical topic, but 
also any other author in which the terms appear), inscriptions and tituli 
picti, to the available archaeological discoveries attesting to medical 
practice. Ultimately, this integrated approach, which bridges together the 
main subject areas in ancient studies, enables us to throw new light on the 
complex and multicultural setting of the Greco-Roman medicine in Egypt, 
and presents — to borrow Vivian Nutton’s words — “an inclusive model for 
understanding the medical word of Antiquity”’.415 

The second keyword of Medicalia Online is ‘verticality’. Such a 
comparative and thorough, i.e. ‘vertical’, approach to the sources 
contributes to improve our understanding of the Ancient World with its 
textual and concrete aspects, with its verba and its realia, and promotes an 
essentially ‘vertical’ rather than ‘horizontal’ dimension in investigating 
lexical items. Thus, this sort of ‘archaeology of the words’ also translates 
into the effort to ‘revitalize’ the past, making it more accessible to the 
present time. 

Furthermore, starting from the evidence of the papyri, particular 
attention is devoted to the evolution and survival of the examined words 
into the modern languages and contemporary scientific discourse. Thus, 
one of the goals of Medicalia Online is to focus on the diachronic, often 
problematic developments of the Greek technical vocabulary tracing its 
trajectory from antiquity to modern times. 

A further aspect concerns the analysis of the relationship, viz. the points 


of divergence and contact, between the terminology attested by the papyri 
of medical content and the — often more sophisticated — technical language 
known through the medieval manuscript transmission of the ancient 
medical writers, from the Hippocratic authors to the compendiasts of Late 
Antiquity. In this view, the lexical studies of Medicalia Online allow us to 
explore the contribution of the papyri to our knowledge of the Greek 
medical language. 


1.3The database and the entries 


The database is built on the open source vocabulary server TemaTres and is 
browsable in different ways. The home page displays a_ threefold 
subdivision by macro-categories, each of which is further divided into 
subcategories providing a taxonomical classification of the terms: 
“Lexicalia’”, i.e. word typologies (e.g. containers, ingredients, instruments, 
termini technici), “Medical branches” (e.g. gynecology, ophthalmology, 
pathology, pharmacology, surgery), “Text typologies” (e.g. adespota, 
catechism, documentary texts, prescription). A single term can also be 
subordinated to two or more subcategories, so that it can be searchable in 
each of them. Another way is to browse the terms and the categories 
alphabetically by clicking a certain Latin or Greek letter either at the top or 
at the left bottom of the home page. Finally, on the very top, it is also 
possible to use a full-text search, as well as an “Advanced search”. In the 
latter case, a drop-down menu provides a submenu of navigation items to 
select the research scope: “Term” restricts the search to the headwords, 
“Meta-term” to the categories, “Non-preferred term” to secondary 
headwords such as variants and diminutives, and “Note” to the thematic 
boxes. 
The lexicographical structure of the lemmas is innovative and reflects the 
interdisciplinary, integrated approach that inspires Medicalia Online. The 
layout of the entries is conceived to offer a broad overview of the examined 
words and is essentially comprehensive, but, at the same time, it is user- 
friendly and applicable to any lexical category and semantic field. User- 
friendliness is indeed an important prerequisite when a lexicographical tool 
is complex and involves a conspicuous bulk of information. Each lexical 
entry consists of a fixed set of thematic boxes (“notes”), as follows: 
—Variants” includes a list of variants, both grammatical (e.g. diminutives) 
and phonetic/spelling variants as found in the papyri, the Latin 
transliteration or form(s) of the term, and the cognates of medical 


relevance, if any; 

—“General definition” gives a dictionary-like definition useful to provide 
the reader with the main information concerning the searched term and 
its ‘immediate’ meaning before (or in case) (s)he goes on reading 
through the full lemma; 

—“Language between text and context” is a linguistic section containing 
discussions on etymology, morphology, semantics, variants and 
cognates of the examined term, but it also discusses its linguistic 
history up to modern times and the diachronic developments of its 
technical meaning(s); 

—“Testimonia — a selection of representative sources” lists some Greek and 
Latin passages from all kind of written sources (literature, papyri, 
inscriptions) in which the term is attested, selected according to their 
medical relevance. Each passage is accompanied by an English 
translation; 

—“Commentary” is the most substantial section of the entry and is aimed at 
contextualizing the term in its textual and _historical-scientific 
background. In order to do this, the section is divided into two chapters. 
The first one (“[the term] and its medical sources”) traces a detailed 
overview of what the ancient sources attest about the term, also 
scrutinizing the possible changes of its semantic value over time from 
its earliest attestations to Late Antiquity, and the comparison between 
its ancient and modern meaning(s). The second chapter (“[the term] in 
practice”’) is specifically focused on the ‘practical’ side of the examined 
item and outlines the connection between the word and its concrete 
dimension. To make just some examples, this means the material 
reconstruction of the related object in case of words denoting res 
medicae, such as containers employed to prepare or store remedies or 
surgical implements, and the methods of treatment and surgical 
procedures to be performed when dealing with names of pathologies 
and disorders, with particular attention to parallels, divergences and 
innovations along the history of medicine; 

—‘Bibliography” includes “Lexicon entries”, i.e. dictionaries, glossaries 
etc., and “Secondary literature”, i.e. more extensive studies on that 
particular topic or word. 

—“CPGM/DDbDP reference(s)” lists the papyrological evidence containing 
the word. Since some of the examined terms occur only in the CPGM 
Online, while others appear also or only in documentary texts dealing 
with medical topics, such as private letters requesting remedies or 
surgical instruments, which are contained in the DDbDP, “the 


documentary evidence (‘DDbDP references’) will be linked to the 
appropriate texts on Papyri.info, the literary or paraliterary one 
(‘CPGM references’) to the forthcoming texts on DCLP, from which, in 
turn, it will be possible to insert links back to Medicalia Online’’.416 


Finally, a clickable list of the terms connected to the main term 
(diminutives, variants, cognates, Latin forms), which are also searchable 
through the alphabetical list that can be found both at the top and at the 
bottom of the home page, closes the lemma. 


2The lexicographical database and the digital 
editions of texts 


2.1The interconnection between Medicalia Online 
and the textual database 


The contribution of Greek and Latin papyri to our knowledge of classical 
languages is an indisputable fact that has been scholarly recognized since 
their discovery in the dry sands of Egypt in the late 19th century. It is worth 
quoting EVANS — OBBINK 2010, v: 


Every scrap of papyrus and every ostracon or tablet unearthed has the potential 
to change some aspects of the way we think about these languages. Such texts 
have the capacity to modify our understanding of the classical forms of both 
languages and for their post-classical development provide evidence of the most 
direct kind we shall ever acquire. The richness of the resource can hardly be 
overstated. 


Exactly like the other categories of papyri, papyri of medical content have a 
massive linguistic potential. The corpus of the Greek medical papyri has 
indeed not only enhanced our knowledge of medical literature and 
everyday medical practice, revealing valuable information on the diseases 
that affected people in the Egyptian chora, as well as their pharmacological 
and surgical treatment. It has also offered rich attestation of Greek technical 
vocabulary, its diachronic trajectory over time, its registers and levels of 
technicality, from the actual medical Greek written or spoken by medical 
professionals when communicating with their colleagues, to the not 
properly technical but still medical language used in everyday life by lay 


persons and practising physicians.417 

In addition to the new attestation of technical terms already well known 
by means of the ‘official’ medical writings transmitted through the 
medieval manuscript transmission, medical texts on papyrus often refine 
our knowledge of weakly attested or extremely rare words and bring back 
to light elements of Greek medical vocabulary previously lost and 
completely unattested in other medical sources. 

An in-depth study of medical micro-language and_ technical 
terminology is thus essential to deeply understand the texts. For this 
fundamental reason, to join the digital editions of the medical texts on 
papyrus with the lemmas in Medicalia Online can be of the utmost 
importance to promote an integrated and mutual enrichment. 

An even more in-depth investigation might be realized by adding to the 
lexical studies the analysis of the morpho-syntactic stucture of the texts by 
means of the application of different levels of linguistic annotation. 
Annotation is indeed a cardinal part of the linguistic analysis of a corpus of 
texts, the method of describing, recording and analysing linguistic 
phenomena through computer-based text corpora that is better known as 
Corpus Linguistics.41s As stressed by REGGIANI 2016, 2: 


A linguistic corpus is usually intended as a selection of sample texts 
representative enough of a language, and though the medical papyri at our 
disposal come from a random and incomplete selection, they can be considered 
as the entire reference population rather than as a sample of a larger group, so 
that linguistic annotation seems to me absolutely feasible. The basic annotation 
layer, related to the analysis of the parts of speech (the one also known as 
treebanking because it is usually represented with a tree graph) would allow to 
conduct an extensive lexical, phraseological-formulaic and syntactic analysis on 
the corpus, aimed also (but not only) at discovering styles and writing strategies 
specific of the medical texts, both literary and documentary: think only of the 
possibility to find out influences or interpolations between authors, or the 
presence of literary echoes in technical or documentary texts. To analyse in 
depth and comprehend the syntactic structure of texts would allow also to solve 
problems of interpretation, or even only to understand the exact meaning of a 
text. 


Thus treebanking, as it is used in linguistics, is a possibility to model how 
sentences are built by creating morpho-syntactic trees. In the field of 
Classics this kind of linguistic annotation is now at a very advanced 
level.4i9 Just to mention two relevant projects, The Ancient Greek and Latin 
Dependency Treebank (AGDT 2.0) is a corpus of ancient Greek and Latin 
literary works, annotated on the morpho-syntactic and semantic layers, 


which has been developed since 2006 at the Leipzig and Tufts Universities 
by Giuseppe G.A. Celano, Greg Crane, Bridget Almas and others,420 while, 
on the more strictly papyrological side, the project Sematia, conducted by 
Marja Vierros and Erik Henriksson at the University of Helsinki, is a 
platform aimed at facilitating the linguistic tagging of digitized 
documentary papyri through the creation of linguistic layers from TEI/ 
EpiDoc XML documents.421 

It is worth mentioning another innovative text analysis tool, which 

might be potentially useful also in the case of the corpus of Greek medical 
papyri. I am referring to CATMA 5 (Computer Aided Text Markup and 
Analysis),422 a tool developed at the University of Hamburg that offers an 
interesting combination of three main features, since it allows collaborative 
annotation and analysis of a text or a text corpus, it supports explorative, 
‘non-deterministic’ practices of text annotation, viz. a discursive and 
debate-oriented approach to text annotation based on the research practices 
of hermeneutic disciplines, and integrates text annotation and text analysis 
in a single web-based working environment. 
One of the main outcomes provided by the digital tools is the possibility to 
support several kinds of linguistic analysis -— lexical, semantic, 
morphological, syntactic — in direct interconnection with the texts, namely 
directly on the digitized textual editions. This allows an — even 
simultaneous — work of in-depth investigation, abstraction and 
conceptualization on and through the text itself, thus enhancing its deep 
comprehension and interpretation in an immediate, dynamic and interactive 
way. Immediacy, dynamism and interactivity are indeed among the most 
stimulating features and perspectives of a digital publication. In the case of 
Medicalia Online, its interconnection with the core textual database may 
exactly be defined with these adjectives: immediate, dynamic and 
interactive. The expected systematic connection of the examined words 
occurring in the digital editions of the medical papyri to the related lexical 
entries in Medicalia Online423 may help not only to refine the definition of 
a term in a particular papyrus text, but also to contextualize the term in all 
the other sources attesting it. Medicalia Online is, thus, so conceived as to 
integrate the textual database, in order to serve as an expansion and a 
supplement to the digital editions of the Greek medical papyri. 

The issue of the integration and interconnection with the main database 
puts the stress once again on the contribution of a close lexicographical 
examination to an enhanced textual understanding. In particular, a thorough 
lexicographical approach can strengthen a double awareness of the 
examined lexical item within the framework of the text, as it will be 


illustrated by means of some selected case-studies: ‘concrete’ on the one 
hand, ‘textual-philological’ on the other hand. 


2.2Lexical studies and ‘concrete’ awareness 


As already said, the interdisciplinary and critically comparative 
methodology of Medicalia Online enables us to explore also the practical 
and material dimension concerning the term and its context. This aspect 
makes it possible to almost physically ‘visualize’ the concrete reality under 
and beyond the words, especially thanks to the second section of the 
“Commentary” (C 2), which is explicitly focused on the concrete side of 
the lexical items (see § 1.3). Thus, this sort of ‘archaeology of the words’, 
aimed at ‘revitalizing’ the past, makes the ancient texts themselves more 
‘living’ and their words almost ‘tangible’. 

An example of an object virtually ‘reconstructed’ starting from the 
evidence of a papyrus is provided by the word bddpia, commonly denoting a 
particular type of container used as a jar to carry and pour water.424 This 
primary function, made clear by the etymology from téwp, “water”, is well 
established in the ancient sources. In the documentary papyri, this container 
is also filled with other contents, such as foodstuffs.425 Only two passages 
of medical authors refer to the bépia (and to the diminutive bdpickn) as a 
container of pharmaceutical use.426 In both cases, the vessels have the 
function of small containers for ointments employed for the storage of the 
therapeutic products prior to their use. This function is attested in just one 
documentary papyrus of medical content dating back to the late IV century 
CE, P.Oxy. LIX 4001. The papyrus is a letter written by a certain 
Eudaemon to his mother, grandmothers and a woman called Cyra. 
Eudaemon, who is a doctor, sends the letter to his surgery presumably in 
Oxyrhynchus, as the address on the back shows (d26d0¢ sic TO iatpEiov). 
He is working away from home for professional reasons, and he now asks 
for the means to make some medical implements on his own. He also 
notifies his family that he received a “hydria of eye-salves” instead of a 
“hydria of animal grease”.427 The passages of medical content do not 
provide any information about the physical appearance of the vessel, but 
the word bédpia corresponds to a well-recognized type of container in the 
(conventional) archaeological vocabulary, and it is one of the most 
common shapes depicted in Attic vase-painting, such as in the famous 
Francois vase (ca. 570 BC). According to all types of ancient evidence, the 
most typical features of this vessel are the presence of some handles 


(usually three: two horizontal side-handles for easy lifting and a vertical 
handle for pouring the water or carrying when empty) and a fairly narrow 
neck set off from the body. It is significant to stress that the comparative 
study of all the sources — written as well as archaeological — allows us to 
conjecture about the material shape of the vdépia in the Oxyrhynchus 
papyrus, as well as, more broadly, in the other two medical passages, even 
though it does not appear any explicit information about its morphology. 
First of all, although in these texts of medical content ddpia / Dépioxn does 
not represent an actual terminus technicus in the vocabulary for medical 
containers, it is likely that the word has been used in connection with 
remedies having an ointment-like consistency because of the shape and the 
considerable versatility and manageability of this small vessel. Assuming 
that the ddpia / bDdpiokn of the medical sources has a narrow neck and a 
vertical handle like some glass Aydriskai for cosmetics and oils found in 
archaeological contexts,428 its morphology seems 

to be particularly suitable for closing and sealing.429 C:\Users\Isabella 
\Dropbox\Medicalia Parma\HYDRIA-final version.docx Thus, it is highly 
probable that these features represent the main reason why in the papyrus 
letter from Oxyrhynchus a little vessel named bdpia has been chosen to 
contain the therapeutic products mentioned: the KodAovpia actually 
received by Eudaemon and the dévyyov previously requested by him but 
never dispatched. In all likelihood, these special details of the vdpia 
attended to by Eudaemon’s family will have assured proper preservation of 
the remedies during their transport from the iatpsiov in Oxyrhynchus to the 
village where Eudaemon was apparently working as a physician. 

A further example of this attempt to ‘revitalize’ and ‘reconstruct’ an 
object from the past may be represented by another name of container, 
pappaKko0nKn.430 The term is a semantically transparent compound 
employed to denote a portable chest or case for the storage of remedies and 
medical implements. The earliest witness of the noun is P.Oslo II 54,6 
(second half of the I — first half of the III century AD), a private letter on 
papyrus from Egypt (likely coming from the area of Oxyrhynchus), 
addressed by a certain Horeion to his father Apollonios. The term occurs 
otherwise exclusively in astronomical and Christian works from the V 
century AD onwards.431 The lack of this term from the medical texts is 
significant and might suggest that, even if the compound mappaKo8nKy 
indicates an exclusively medical container, viz. a technical accessory, it 
never developed into an actual terminus technicus. It is likely that it served 
as a lay synonym for the physician’s tool-case in common language, for 
instance in practitioners’ everyday conversations with their patients, instead 


of more ‘official’ and technical terms for similar objects used among 
professionals, such as vap0n&,432 a noun which is well known in literary 
sources, and explicitly defined vépO0n& iatpikdc in a literary papyrus 
containing alchemical recipes (PSI inv. 22011,48 = TM 65816), but also — 
though less attested — deAtdpiov,433 Eyyetpioiov,434 and mnpa.435 As a matter 
of fact, according to SCHIRONI 2010, 338: 


Technical terms often have lay synonyms in common language; this is particular 
evident in medicine where technical and lay terminology coexist [...] and often 
physicians use the latter in order to be understood by the patients. 


Thus, it seems that the compound mappakoOnKn might be placed at an 
intersection point between the language of everyday life often documented 
in the papyri and the technical vocabulary for medical containers. 
Furthermore, even though the extant written sources do not provide 
illuminating information about the shape and the physical appearance of 
pappaKkoOnKn, archaeological evidence may represent a fertile ground to 
start formulating hypotheses. Indeed, a certain number of cases for storage 
and transport of drugs and remedies and/or surgical implements have been 
unearthed in excavations. Among these, a characteristic typology is 
rectangular (on average 12 <x 6-7 x 2—3 cm), equipped with a sliding outer 
lid and internally divided into compartments, each having its own hinged 
cover in order to store together different medical substances with no risk of 
contamination. Assuming a terminological and typological overlapping 
between the PappaKkoOfKka1 mentioned in the written sources and any of the 
archaeological boites médicales, it is tempting to surmise that this 
compartmentalized rectangular type with sliding lid was the most suitable 
shape for the domestic pharmacy chest mentioned in the Oslo papyrus. This 
hypothesis relies on the association between the almost intuitive formation 
of the compound - literally a case for medicines (pdpyaKov + OAKn) — and 
the most common type of container with that function in both professional 
and non-professional contexts. Archaeological discoveries have offered 
several specimina dating back to the Roman period, therefore 
contemporary with the papyrus from the Oslo collection. Often very well 
preserved, and sometimes still holding residues of their pharmaceutical 
contents, some of these OfKo1 surely derive from a professional field, but a 
similarity in shape of their household counterpart may be likely 
presumed.436 

Finally, in both of these illustrative cases, an operation of ‘archaeology 
of the words’, consisting in integrating written and material evidence, 
contributes to shed new light on the (hypothetical but plausible) physical 


reality hidden behind the ancient words. 


2.3Lexical studies and ‘textual-philological’ 
awareness 


A thorough analysis of a term in all its written attestations can bear huge 
potential by improving our awareness of the ancient texts and their — 
sometimes highly problematic — philological issues. In-depth lexical studies 
like those offered by Medicalia Online can indeed give a considerable 
contribution to promote the contextual and philological interpretation of 
medical texts, sometimes even resulting in new readings and corrections to 
the established editions. The critical exegesis of these texts is strictly 
dependent on the understanding of their technical terminology, to an extent 
that reaching a more exact definition of the words expressing the contents 
can enhance the philological awareness of the textual context in which they 
occur. 

Furthermore, Medicalia Online allows both a ‘holistic’ and an ‘atomic’ 
approach to the object of study. The analysis is carried out in a ‘holistic’ 
way in the sense that the term is studied in all its occurrences in order to get 
the broadest possible overview of its meaning(s). At the same time, it is 
also performed in an ‘atomic’ way, so that the word is grasped in its 
verticality and all its details are accurately examined and understood. This 
operation encourages a ‘double movement’: from the text(s) to the word — 
the ‘holistic’ study of the term in all the written sources attesting it leads to 
a deeper comprehension of the term itself —, and from the word to the 
text(s) and the textual tradition. This thorough knowledge of the word 
entails a better understanding chiefly of the medical text on papyrus that is 
the starting point of the lexical analysis, but also, very often, of the 
passages — or at least of some of them — by medical authors that use this 
word. This implies a more refined philological awareness that can also 
enable us to amend errors in the manuscript transmission. 

To illustrate the contribution of an in-depth analysis of the technical 
terms to the exegesis of the medical texts — whether medical papyri or 
passages by medical writers — it may be useful to take as a specimen the 
observations raised by the study of the word xa8édptoc.437 KaBEdptos Is an 
adjective literally meaning “of or for sitting” (LSJ9 851 s.v.). It is derived 
from the name of the object that concretely receives the action of sitting, 
Kabédpa (“seat”). As to the word formation, Ka0édpioc is a denominative 
adjective in —toc, suffix of Indo-European origin usually employed to form 
adjectives from noun-stems and productive during the entire history of 


ancient Greek, from the Homeric language to the Koine.438 Like all the 
other derivatives of the noun KaGédpa,439 KabEdptoc seems to be quite late, 
since it does not appear before the II century AD. Its earliest extant 
attestations are two medical texts on papyrus, and this is the reason why the 
evidence of the papyri is particularly relevant in this case: P.Aberd. 11,10, 
an ophthalmological catechism of the II century AD (http://litpap.info/ 
delp/63332), and P.Ryl. II 529r,57 (and maybe II. 70-1), a fragment from a 
papyrus codex of the late II[ century AD containing a surgical treatise 
concerned with the treatment of shoulder dislocation (http://litpap.info/ 
dclp/59970). 

Starting from these early attestations, «a0édpiog is an adjective 
appearing only in medical sources to denote the sitting position of the 
patient, so that it seems to be a genuine terminus technicus of the medical 
micro-language. Its technical value is strongly confirmed by the medical 
authors, from Oribasius to Paul of Aegina, where the adjective is mostly 
used in the description of different kinds of surgical operations, but also in 
other medical contexts. The common formula is ka0édptog + 6 KaLVa@v / 
ma0y@v + oynpatifw vel sim., with Ka0édptoc, as a rule, in predicative 
position,440 but the adjective is also associated with the noun oy7jL0.441 

In order to contextualize the topic, it might be relevant to stress that 
placing the patient in the proper position has been an important prerequisite 
for surgery since ancient times. Like nowadays, different physical positions 
were required for different procedures. Therefore, it was essential to 
identify the correct position for any given operation. The importance of 
surgical positioning emerges, for instance, from a passage of the 
Hippocratic treatise De officina medici (Off. 2 = II 275-6 L.): 


Ta 5’ €G yElpovpyinv Kat’ intpsiov’ 6 dobEevéwv' 6 Spdv° ot bANpPETAL TH 
6pyava’ TO OAc’ Skov’ 6ka>c 600° 6KMs SKOV TO GALA, TA GpLEVa’ O YPdvOc' O 
TpOMOG’ O TOMOG, 


in which the adverbs 6xov and 6kwc express respectively “where and how” 
and refer to what precedes in the list, that is “the patient, the operator, the 
assistants, the instruments, the light”.442 This aspect is also evident in a 
medical papyrus from Oxyrhynchus, P.Oxy. LXXIV 4972, dating back to 
the II-III cent. AD, which belongs to the catechistic genre and contains a 
systematic exposition of the divisions of surgery (http://litpap.info/ 
delp/119317). In ll. 3-7, the most correct positions for certain operations 
are discussed and, in particular, it is mentioned when the patient is lying 
back on a sloped couch with either the head (Gvéppomov) or the feet 
(katéppomov) raised higher: 


TO O€] OYNLATIKOV EOTLV TO TA[V | ExiTH]di@v CYNLATOV dozEp | Stav] AéyopEv 
avapotov (/. avappomov) i | Katdpjomov (/. Katdppomov) oynpaticw (U. 
oxnpatiCev) tov | KauVvJovta 


The position-based is that concerned with appropriate positions, as when we 
speak of positioning the patient tilted up or tilted down. 


(transl. LEITH 2009, 63) 


The synchronous ‘holistic’ and ‘atomic’ study of the adjective ka8édptoc in 
all its occurrences led to suspect some textual uncertainties in two passages 
from late antique compendiasts. The first one, transmitted by Oribasius, is 
part of an excerpt taken from the work entitled [epi zoiovpévev 
BonOnudtov by Antyllus, a surgeon and physician of the II century AD. 
The passage concerns the most proper couch for the patient. It is reported 
that a tilted couch, i.e. a couch with the head higher than the feet, puts the 
patient under strain because (s)he is like seated on a chair, even if this 
position is suitable for those who suffer from head pain. The text printed by 
the Teubner editor of Oribasius, J. Raeder, runs as follows (Coll. [IX 14,6, 1— 
3 = CMG VI 1,2, 15,21-3 R.): 


1 (sc. KAivn) 5’ &vappomos OMdSpa KOTOD TOMTUKH, Eotkvia KABEdSpiov OYHLATL, 
TOIs OE TEPi KEMOATV ODK AVEPLOOTOG. 


The manuscript tradition is unanimous in recording kaGedpiov oynpoatt,443 
but U.C. Bussemaker and Ch. Daremberg in their previous edition of the 
Collectiones Medicae (Paris 1854, p. 310,4) make the emendation 
Ka8edpi@ oynpati. In the former case, the term in genitive is a noun, i.e. 
the neuter ka0édptov, and is referred to the material object, a small chair.444 
But this juxtaposition is unparalleled, whereas the adjective Kka8édptoc + 
oyi|W0 is certainly attested, as aforementioned, and appears in dative also in 
another excerpt from the same work by Antyllus (cf. Coll. VI 23,9,4-10,1 = 
CMG VI 1,1, 181,23-4 Raeder: tév 5‘ drupétav (tovs ...... ) Kabedpio 
oYWaTl ai@pyntéov). Thus, despite the consensus of the manuscripts, the 
study of all the attestations of the term leads to approve, on a grounded 
basis, the correction made by Bussemaker and Daremberg. 

Also in a passage from Paul of Aegina, in a chapter on the insertion of 
the catheter in a male bladder, the occurrence of the term seems to conceal 
an element of philological uncertainty. The text edited by J.L. Heiberg (VI 
59,1,9 = CMG IX 2,98,10 H.) is tov 6& Kdpvovta oynpaticoavtEs sic 
Kxadédptov, literally “having placed the patient on a seat”. According to 
Heiberg’s apparatus, the only divergence in the manuscript tradition should 


be the presence of kat following ka0édpiov in two codices of the XIV 
century from Paris (D, cod. Paris. Gr. 2208, and F, cod. Paris. Gr. 2292). 
However, it might be relevant that the second hand of a codex of the X 
century, V (cod. Paris. Gr. suppl. 446), deleted sic. This deletion might 
reflect the common expression with xaQédpiog as an adjective (tov 6& 
KGLVOVTG CxNLATicavtEs KABESpltOV), whereas in the text transmitted by the 
manuscript tradition and printed by Heiberg ka@édpiov is a noun and is 
unparalleled in this formula. Therefore, in case the deletion made by V? is 
correct — as it seems to be given the occurrence of the term in similar 
expressions -—, the adjective «aQédpiog would acquire its common 
predicative value, even if it is usually placed before 6 kéuvav / adoyov, 
and not after, as it would be in this passage. 

Even more significant from the textual viewpoint are the 
aforementioned medical papyri attesting the word. In the earliest papyrus, 
P.Aberd. 11,445 the term appears in the discussion on pterygium surgery 
immediately before the lacuna at |. 10 (cf. Il. 9-13): 


yeip[ovpysia Tod atepvysiov.] | peta TOV KabESpElo[vV SvtA TOV TéOYOVTH, EK] | 
Tod dpOaApod dSig[vt] BAgpapa diaoteiAavtes] | TO atTEpbystov Si[eK~avodLEV 
ay«i]lotpsio1, BeAOvny [5é Aivov Kai Tpixa inmsiav] 


Surgery of pterygium. After the patient is seated, having separated the two 
eyelids of the eye, we will isolate the pterygium from the eye with a hook, and a 
needle having a thread and a horsehair... 


The restoration é6vta tov miéoxovta, with the participle dependent on peta, 
“after” in temporal sense, is a likely supplement, but also kapvovta may be 
considered.446 On the one hand, the verb sipi is unparalleled in this 
expression, on the other hand the space is not enough for the usual 
oynuatiCovta, and this makes the presence of évta plausible. Moreover, the 
article before the adjective seems to suggest an attributive value, whereas 
in the medical authors, as already seen, ka0édptioc is always in predicative 
position. Different is the restoration proposed by TURNER 1939, 13: peta 
Tov KaOEdpEto[V Biov ?]. It was very probably influenced by a passage by 
Soranus (Gyn. I 27,3,4 = CMG IV 18,2 Ilberg: ka8édptov didyew Biov), the 
only author who uses the adjective xa8édpiog with the meaning of 
“sedentary” (LSJ° 851 s.v. ka0édptoc 2), but it absolutely makes no sense 
in this context. This last case strongly illustrate how a good lexical 
awareness of the semantic values of a word in all its occurrences and kinds 
of sources can contribute to a better philological understanding of the text 
itself. 


Philologically challenging and stimulating are the Il. 66—76 of P.Ryl. III 
529r (col. 11): 


oi] | pév GAAot KaBE[Sprov,, ]looav tov Kau[vovta] oyxn||LatiCetw. Husi[s dé... ] | 
xekAévol, J‘ v " 10 ’AA[séév]|Spiov oxfiud gotw Sv[oad]lyéotatov, tO 56é 
KskA[tlé]|vov doaréotepov. a[oga]|Agotepov 4é OTE LE[v Batt]lov oynuatiCetw 
tov [mG0]|yovta, OT 5é TPHVi). 


The other physicians put the patient in the sitting position, whereas we lay him/ 
her down: the “Alexandrian position” is extremely painful, while the lying 
down position is safer. It is safer to place the patient sometimes on his/her back, 
sometimes on his/her stomach. 


The Rylands papyrus was first published by ROBERTS 1938, 158-62 
(ed.pr.), then republished by MARGANNE 1998 110-47 (ed.alt.) with some 
textual differences, as in the case of |. 67. Here the adjective caQédptocg was 
very plausibly restored by MARGANNE 1998, 112 and 117, whereas the 
editor princeps (ROBERTS 1938, 160) simply printed katg[. The term is 
mentioned in the earliest extant discussion on the best and less painful 
position in which to place the patient before performing the reduction of 
his/her dislocated shoulder. Indeed, in Greek medical sources the sitting 
position, the ka0édpiov ofa, is sometimes explicitly rejected or preferred 
to other positions, especially the supine one. 

A first observation on this passage: given the end woav in |. 68 and the 
context, one could expect the first aorist active indicative of a verb of 
thinking/saying (e.g. “think”, “claim”, “‘advise”) in the lacuna of |. 67, 
followed by the infinitive oynpatigew in Il. 68-9. Considering the previous 
presence of ka0édpiov and what seems to be the space left at the end of the 
lacuna, which is measured with three dots in Marganne’s edition, a 
plausible supplement might be perhaps n&{]|woav from d&16m, with the 
meaning “to think”, i.e. GAAot Ka0E[Spiov HE{]|Moav TOV Kap[vovta] oxT|| 
paticetv. Interestingly, ROBERTS 1938, 16, though without any supplement 
to the text, translates: “other authorities (advise) that the patient be 
disposed in a ... position”. 
| 

Moreover, at the end of |. 69 before the break, possible supplements 
might be: fusi[s 58 pGAAOV] | KexAmévol, J‘ v ’ or Husif[c 5é Kata]| 
kexAmévol, Tv’, cf. Aét. XV 5,50-1 (19,15—-6 Kostomiris) oynuatiléo8@ 
O NAOYWOV KATAKEKALLEVOG. 

A more puzzling textual issue concerns I]. 70—1. The context (Il. 66-81) 
is the description of a method of reduction opposed to the procedures of 


other physicians or schools and it is specified that others place the patient 
seated; on the contrary, the author recommends lying him down. According 
to the restoration made in Il. 70-1 by both Roberts and Marganne, i.e. 
’Ad[eGav]|Spiov, the so called ’AAséavdpiov oyijwa is defined as 
dvoadyéotatov, whereas the lying-down position is regarded as less painful 
and safer. Marganne (p. 129) considers the ’AAgéavdptov oxf\yo as the 
same as the sitting position mentioned in |. 67, and not as a different 
posture. In such a case, the second statement (10 ’AA[Eéav]|Sptov oxr|La 
got dv[oaA]|yéotatov, TO 5& KeKA[I-LE]|vov Go@aAgoTepov) would seem 
to explain the reason for the author’s preference for the lying-down 
position, as expressed in the previous lines (oi] | Hév GAo1 KaPé-[dptov,_ ]| 
wMoav Tov KOI[vovta] oxN||LWaTiCew. Husi[c dé... ] | KekAwévol Tv’). 

The point is that there is no attestation of the ’AAsEdvdptov oxo in 
medical literature, and Withington’s suggestion447 that the ’AAséavdpiov 
oyT|Wa may correspond to the position on the Thessalian straightbacked 
chair, the péya 0c OecoaArkov, used for dislocations and mentioned by 
Hippocrates (Art. 7,36—43 = IV 92,10-94,1 L., cf. also Gal. In Hp. Art. 1 22 
= XVIla 344,1-345,8 K.), seems unmotivated.448 

The reading ’AA[séav]|5ptov, accepted by MARGANNE 1998, 112, was 
first restored by ROBERTS 1938, 160, who did not understand the presence 
of xa8é[dprov in |. 67, as aforesaid. But a collation with the digital image of 
the papyrus449 has raised difficulties with the supplement ’AA[eéav]|5ptov, 
inasmuch the traces are incompatible with the usual shape of a and A. At 
the break, indeed, part of a small horizontal trace survives, consistent with 
the lower trait of an a. It is preceded by what seems to belong to a vertical 
trait slightly sloping to right. The shape of these traces can be convincingly 
compared with the sequence ya recurring several times in the papyrus.450 
Accordingly, also taking into account the ending dptov in 1. 71, a likely 
supplement that seems paleographically compatible and fits well the space 
is TO ya[p Ka0E]|Spiov. Additionally, on the syntactic side, the presence of 
ya[p would acquire an explanatory function perfectly appropriate to the 
content, thus introducing why the author — that in the previous sentence 
affirms his preference for the lying-down position — rejects the Kka8édptov 
oyi|Wa, being it dvoaAyéotatov, “more painful”. This sense of opposition is 
then marked and strengthened by the adversative dé (1. 72), where it is 
stated that the lying down position is safer. 

To conclude on this aspect, the example of Ka0édptoc is useful to 
highlight the potential philological contribution of the vertically in-depth 
and both ‘atomic’ and ‘holistic’ lexical studies carried out for Medicalia 
Online. Thus, given the high degree of textual awareness offered by this 


kind of lexical studies, the present specimen illustrates how the medical 
text can be improved in terms of new and more reliable (or at least 
plausible) readings and emendations, as well as it can be deeper and better 
understood: 


oi] | pév GAAOL KaBE[Sptov Ei]|ooav tov Ké[vovta] oxN||LatTiCew. HuEi[s 5é 
LaAAov] | Kexdpévol I'v ': 10 yap KaGé]|Spiov oxfiud gotw dv[oaA]|yéotatov, 
TO O€ KEKA[ILE]|VOV GoMaAéoTEpov. 


The other physicians think that it is better to put the patient in the sitting 
position, but we rather that it is better to lay him/her down: the sitting position is 
indeed extremely painful, whereas the lying down position is safer. 


3Conclusions 


The experience of a specialized lexicographical tool like Medicalia Online 
demonstrates the different kinds and levels of usefulness and the potential 
of a digital lexicographical database characterized by a wide-ranging 
approach in the era of Digital Humanities. Among these, the usefulness of a 
tool specifically conceived to carry out in-depth, ‘vertical’ lexical studies 
aimed at providing the broadest possible overview of the examined items 
under multiple perspectives, namely linguistic, archaeological, historical- 
scientific. The usefulness concealed in the inner nature of a digital 
instrument, that is its dynamism, its flexibility, its capability of allowing the 
author of the entry to constantly update the contents. The usefulness of the 
interdisciplinary, comprehensive approach to the Ancient World, which 
makes Medicalia Online a helpful resource for contributing to improve and 
reconsider the studies in medical papyrology as well as in the other subject 
areas involved, and broadens the circle of users to any scholar or enthusiast 
of those research fields. Furthermore, the user-friendliness of the entries in 
spite of the wide range of information and, most importantly, the mutual 
enrichment resulted from the interconnection between the MedOn lemmas 
and the digital editions of the medical texts on papyrus in the core database. 
This strongly encourages an enhanced textual understanding stimulating a 
double awareness — a concrete one and a textual-philological one —, as the 
selected specimina have illustrated. The outcomes are, on the one hand, the 
attempt to reconstruct the physical reality hidden under the words and, on 
the other hand, a better and deeper exegesis of the ancient medical texts 
through a better and deeper knowledge of the words themselves. 

Such a ‘vertical’ lexicographical approach also helps elucidating the 


degree of technicality of the medical terms, as well as of the texts 
containing them. Two of the presented case studies can be taken as 
illuminating examples of this aspect. kaédpt-og (§ 2.3) has a marked 
technical value, since it occurs, with its specialized medical meaning, only 
in strictly medical sources; conversely, the compound gappaKoOjKny (§ 
2.2), characterized by a pronounced morphological transparency, did not 
developed into a genuine technical term and was probably employed as a 
colloquial noun for an object of medical use, the physician’s tool-case or its 
household counterpart, so that it was not a medical terminus technicus but 
just a word with a medical meaning. And indeed it does not appear in a 
stricto sensu medical papyrus, but in a private letter carrying medical 
information, P.Oslo II 54. Horeion, the author of the letter, who lives away 
from home for personal or professional reasons, asks his father Apollonios 
to send him the portable medicine-chest (ll. 5-6 zéuwov | por tiv 
PaApLaKoOnKyy), as well as two remedies with different properties, that is a 
biting pharmakon and a much milder one (Il. 7-9 aithoag zlap]a tod 
iatpod | PdppLaKov dakvnpov| Kai Etepov NovUtEpov). The terms employed 
by Horeion point to different degrees of technicality and particularly 
significant is the juxtaposition between the @dppakov dakvynpov and the 
ndvtepov one, likely representing two kollyria of contrasting virtues, 
perhaps belonging to the typology of the acharista.451 These terms, indeed, 
especially the former, are technical adjectives applied to eyesalves and 
ophthalmic preparations. Thus, whilst the vocabulary employed by the 
writer reflects different levels of technicality, the choice of accurate and 
rare words reveals his proficiency in the medical field and a certain medical 
literacy.452 This might mean that Horeion was either a literate layman 
interested in medicine, or even a person with healing skills, such as a 
would-be physician or a pharmakopoles. The case of P.Oslo II 54 is useful 
to demonstrate how a thorough lexical study of medical papyri from Greco- 
Roman Egypt contributes to shed new light on the socio-linguistic context 
in which these papyri were inserted, and may provide invaluable glimpses 
of the individuals ‘behind their words’. 

Finally, all these factors restate again the usefulness of Medicalia 
Online and its methodology to plumb the depths of the past, allowing for 
the revaluation of the entire material come to the present and promoting a 
more integrated knowledge of the Ancient World, to which a voice — if not 
‘living’ at least not ‘dead’, or irremediably lost —can be returned: the echo 
of its written and material soul. 
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intertextuality 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 
interventions (editorial, scribal) 1, 2, 3, 4, 5, 6, 7, 8, 9 
iotacism, itacism 1, 2, 3 

IRCyr (Inscriptions of Roman Cyrenaica) 1 


Jordens, A. 1 
Juxta 1 


Kaibel, G. 1 

Kallimachos Project 1, 2 
Kapsomenakis, S.G. 1 
Kenyon, F.G. 1 

Koine Greek 1, 2, 3, 4, 5, 6, 7 


lacunas 1, 2, 3, 4, 5, 6, 7, 8,9 

layout 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 

LDAB (Leuven Database of Ancient Books) 1, 2, 3, 4 
lectional signs 1, 2, 3 

Leiden conventions 1, 2, 3, 4 

Leident 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 
Leith, D. 1, 2, 3 

lemmas 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 
lemmatization 1, 2, 3, 4, 5, 6, 7 

lenis (spiritus) 1 

lexical issues, lexicography v, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 
lexicon 1, 2, 3 

line-by-line commentary 1, 2, 3 

literacy 1, 2, 3, 4, 5,6 

literary papyri v, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22; 23, 24, 25, 26, 27 

Litinas, N. 1 

Litpap.info 1, 2, 3,4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 
LOFTS (Leipzig Open Fragmentary Text Series) 1, 2 
Lougovaya, J. 1 

Lucullus online 1 

Luiselli, R. 1 

Luppe, W. 1 

LXX Septuaginta 1 


magical papyri 1 

Malaspina, E. 1 

Magnani, M. v, 1, 2, 3, 4 

Mahaffy, J.P. 1 

MALP (Morphologically Annotated and Lemmatized Papyri) 1 
Mangerud, J. 1 

manuals 1, 2, 3, 4, 5 

Maravela, A. 1, 2, 3 

Marganne, M.-H. 1, 2, 3 

margins 1, 2, 3, 4, 5, 6, 7 

marginalia 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 

marks (ancient) 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 

markup 1, 2, 3, 4,5 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26 

Martini, G. 1 

Mastronarde, D. 1, 2, 3 

MATE tagger 1, 2 

Materiale Textkulturen 1 

materiality 1, 2 


,6, 7, 


Mayser, E. 1 

McGann, J. 1 

McLeod, D. 1 

medical papyri vi, 1, ie 3,4; 5, 6; 7.8, 9, 10, 117, 12, 13 
Medicalia Online v, 1, 2, 3, 4, 5, 6, 7, 8 
medicine 1, 2, 3, ie , 6,7, 8,9 
Mertens-Pack1 2 

metadata v, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 
meta-papyri v, 1 

metatextuality 1, 2, 3, 4 

Michigan Medical Codex 1, 2 
micro-language 1, 2, 3 

middot 1 

mid-punct 1 

Mihalyko, A.T. 1 

milestones 1, 2 

Mirizio, G. 1 

mirror 1 

mistakes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 
Mnemon 1 

Monella, P. 1 

monograms 1 

Mons Claudianus 1, 2, 3, 4 

Morpheus 1, 2, 3, 4,5 

Muellner, L. 1 

multitextuality 1,2, 3, 4,5 


Nagy, G. 1,2 

nested tags 1, 2, 3, 4 
Nicander 1, 2, 3, 4 
nominative 1, 2 
normalization 1, 2, 3, 4 
notebooks 1, 2 

notes 1, 2, 3 

numbers 1, 2 

Nutton, V. 1 


obelismene (diple) 1, 2 

OCR 1 

Odyssey 1, 2,3 

OGL (Open Greek and Latin Project) 1 
Olson, S.D. 1 

ontology model 1, 2, 3, 4, 5,6 

open access 1, 2 
Orestes 1 

Oribasius 1, 2, 3, 4, 5,6 
orthography 1, 2, 3,4, 5 
ostraka 1, 2, 3, 4, 5, 6, 7, 8, 9, 
Ott, W. 1 

outdent 1 


oy 10) A, 12 
0, 11, 12, 13, 14, 15 


@ 
i, 
— 00 


Oxygen 1 


palaeography v, 1, 2 

Palamedes 1 

palimpsest 1, 2 

PapPal (Papyrology-Palaeography) 1 

Papyri.info v, 1, 2, 3, 4,5, 6, 7, 8-9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 
Papyrological Editor 1, 2, 3, 4 

Papyrological Navigator 1, 2, 3, 4, 5, 6, 7, 8, 9 

paradosis 1, 2, 3, 4 
paragraphos 1, 2, 3, 4 
paraliterary papyri 1, 2, 3, 4 

parallels 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 
paratextuality 1,2, 3, 4, 5,6 

parchment 1, 2, 3, 4 

parentheses 1 

Parma 1, 2, 3, 4 

parsers, parsing 1, 2, 3,4, 5 

part-of-speech (POS) tagging, postag 1, 2, 3, 4 
Paul of Aegina 1, 2, 3, 4, 5, 6 

Perseids 1 

Perseus Digital Library 1, 2, 3 
Perseus-Under-Philologic 1 

Petenephotes 1, 2 

phenomenology 1, 2, 3, 4, 5, 6 

phonetics, phonology 1, 2, 3, 4, 5, 6, 7, 8, 9 
phraseology 1, 2 

Pierazzo, E. 1 

Plato 1 

PML Tree Query Engine 1 

prescriptions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 

PROIEL 1 

Pseudo-Galen 1 

Pseudo-Soranus 1 

Pseudo-Hippocratic writings 1 

pterygium 1 

Ptolemaic texts 1, 2 


, 3, 


quantities 1,2, 3 
questionnaires 1, 2, 3, 4, 5, 6, 7 
quotations 1, 2, 3, 4, 5, 6, 7 


Raeder, J. 1 

raised letters 1, 2 

receptaria 1, 2 

recipes 1, 2, 3, 4, 5, 6, 7, 8, 9- 10, 11, 12 
recto 1, 2,3 

Reggiani, N. 1, 2, 3, 4, 5, 6, 7 
regular expressions (regex) 1 
regularizations 1, 2, 3, 4, 5,6, 7, 8,9 
restorations 1, 2, 3 

Riafio Rufilanchas, D. 1, 2, 3 
Rinoldi, P. 1 

Roberts, C.H. 1 

Robinson, P. 1 

Roman texts 1, 2, 3, 4 


SADiLaR (South African Centre for Digital Language Resources) 1 
Sahle, P. 1 

Sammelbuch 1, 2, 3 

Sappho 1 

Schmidt, D. 1, 2 

Schmoll, H. 1 

scholia 1, 2, 3, 4, 5, 6, 7 

Scholia Euripidea 1, 2 

school texts 1, 2 

Schubert, P. 1 

Schwartz, E. 1, 2 

search tools, searching 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23 

searchability 1, 2, 3, 4, 5, 6, 7 

semantics 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 
Sematia vi, 1, 2, 3, 4, 5, 

semicolon 1 

separators 1 

SETS Treebank Search 1 

signs 1, 2, 3, 4, 5, 6, 7, 8,9 

sinusoid 1 

slanting stroke 1 

slashes 1, 2 

Soranus 1 

Sosin, J.D. 1 

SOSOL (Son of Suda Online) 1, 2, 3, 4 

spelling 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 
spirits 1, 2, 3 

standard 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 
standardization 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 

stand-off annotation 1, 2, 3, 4, 5 

stemma, stemmatic model 1, 2, 3, 4, 5, 6, 7 

stigmai 1 

Stolk, J.V. v, 1, 2, 3, 4, 5, 6, 7 


Stoop, J. 1 

strokes 1, 2 

sub-literary papyri 1, 2, 3 
subscript 1 

substandard 1, 2 
superscript 1, 2, 3 
supraline 1 

supralinear writing 1, 2 
surplus 1 

symbols 1, 2, 3,4, 5 
Synallagma 1 

Synopsis Project 1, 2 
syntax 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 
Syriac texts 1 


tablets 1, 2,3, 4,5 

tags, tagging 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 
teaching 1, 2, 3 

TEI (Text Encoding Initiative) 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17 

Tematres 1 

textuality 1, 2,3, 4,5 

Theognis 1 

Theriaka 1, 2, 3, 4 

THV (Thesaurus Herculanensium Voluminum) 1, 2, 3 

tituli picti 1 

TLG (Thesaurus Linguae Graecae) 1, 2, 3 

TM numbers 1, 2, 3 

TM Text Irregularities v, 1, 2, 3, 4 

tokens, tokenization 1, 2, 3, 4 

Tomasi, F. 1 

transcriptions 1, 2, 3, 4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 
Transkribus 1 

translations 1, 2, 3, 4, 5, 6, 7, 8,9 

transtextuality 1, 2, 3, 4 

treebanks, treebanking v, 1, 2, 3, 4— 5, 6, 7, 8, 9 

Trismegistos 1, 2, 3, 4, 5,6 

Trismegistos People 1, 2 

Turner, E.G. 1 

TUSTEP 1 


UDPipe tagger 1 

underdot 1, 2 

underline 1 

update 1, 2, 3, 4, 5, 6, 7, 8 

URI (Uniform Resource Identifier) 1 
URL (Uniform Resource Locator) 1, 2 
URN (Uniform Resource Name) 1, 2, 3 
user-friendliness 1, 2, 3, 4 


vacat 1 

Valerius 1 

van Groningen, B.A. 1 

Van Minnen, P. 1 

variants 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 
variation vi, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 
verso 1, 2, 3, 4 

vertical stroke 1, 2 

vestiges 1 

Viereck, P. 1 

Vierros, M. vi, 1, 2, 3, 4, 5, 6, 7 

Vigilanti, R. 1 

Vindolanda 1, 2 

virtuality 1, 2, 3,4, 5 

vocabulary 1, 2, 3, 4, 5, 6, 7, 8 


West, M.L. 1 

Wilcken, U. 1 

Withington, E.T. 1 

writing 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 
Wurzburg 1, 2, 3, 4, 5, 6, 7 


XML 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22 
XPath 1 

XPointer 1 

XQuery 1, 2, 3 


Yiftach, U. 1 
Youtie, H.C. 1, 2, 3, 4,5 
Yuen-Collingridge, R. 1, 2, 3 


Zaytseva, K. 1 


ll. Papyrus texts 


The papyri are cited according to the official Checklist of Editions (http:// 
papyri.info/docs/checklist), to which add GMP = Greek Medical Papyri I-Il, 
ed. I. Andorlini, Firenze 2001—2009. The list has been compiled with the 
precious help of Alessia Bovo, for which | am most grateful. 


BGU | 1 

BGU 12 1 

BGU | 243,15 1 
BGU | 326 1 

BGU II 451 1 

BGU III 994 1 

BGU XIll 2332,20 1 
BGU XIll 2359,10 1 


BGU XVI 2651,6 1 
BGU XVII 2682,6, 9-10, 10 1 
Chr.M. 172,i,15 1 
Chr.M. 216 1 

Chr.M. 217,9 1 
Chr.M. 316 1 

CPR IV 198,16 1 
CPR XXII 2,1, 5, 9, 11 1 
CPR XXIV 27,17 1 
CPR XXV 25,4 1 
CPR XXX 28 1 
DDbDP 2013 1 14 
DDbDP 2015 1 1, 2, 3 
DDbDP 2015 2 1 
DDbDP 2015 3 1 
DDbDP 2016 1 1 
DDbDP 2016 2 1 
FIRA2 Ill 50 1 

GMP 1 1,i 1, 2 

GMP | 1,i,4-5, 11-12 1 
GMP 16 1 

GMP II 10 1, 2 

GMP II 10,6—7 1 
Jur.Pap. 25 1 
Jur.Pap. 36 1 

MPER n.s. XIII 6 1 
MPER n.s. XIII 9,1 1 
O.Berenike II 237 1 
O.Claud. | 1 
O.Claud. | 131 1 
O.Claud. | 135 1 
O.Claud. | 139 1 
O.Claud. | 148 1 
O.Claud. | 155,3-5 1 
O.Claud. Il 227 1 
O.Claud. Il 236 14 
O.Claud. Il 243 14 
O.Claud. Il 245 14 
O.Claud. II 245,2—7 1 
O.Claud. Il 246 14 
O.Claud. Il 248 14 
O.Claud. Il 276 1 
O.Claud. Il 363 1 
O.Claud. Il 383 14 
O.Claud. IV 723 1, 2 
O.Claud. IV 724 1 
O.Claud. IV 798,6 1 
O.Claud. IV 839 1 
O.Claud. IV 840 1 


O.Did. 37 1 

O.Did. 417 1 

O.Edfou II 318,7 1 
O.Narm. 3 1 

P.Aberd. 11 1, 2 
P.Aberd. 11,10 1 
P.Aberd. 124 1 

P.Amh. II 154,2, 8 1 
P.Ant. | 28 1 

P.Ant. Ill 126 1 

P.Ant. Ill 126 (a) 1 
P.Ant. Ill 127 1 

P.Ant. Ill 127, Ib, 6 1 
P.Ant. Ill 184 1 

P.Ant. Ill 186 1 

P.Ant. Ill 214 1 
P.Berl.Moller 13 1 
P.Berl.Moller 13,i 1 
P.Berl.Moller 13,ii,1-3, 4-8, 9-15 1 
P.Berol. inv. 6845 1, 2 
P.Berol. inv. 10570 1 
P.Berol. inv. 11911 1 
P.Berol. inv. 12310 1 
P.Berol. inv. 17038 1 
P.Berol. inv. 17048 1 
P.Berol. inv. 21155 1 
P.Bodl. | 17,i,9 1, 2 
P.Bodl. | 47,12, 20, 26 1 
P.Cair.Masp. | 67003 1 
P.Cair.Masp. | 67005 1 
P.Cair.Masp. | 67006 1 
P.Cair.Masp. | 67007 1 
P.Cair.Masp. | 67008 1 
P.Cair.Masp. inv. 67172 1 
P.Cair.Masp. inv. 67173 1 
P.Cair.Masp. inv. 67174 1 
P.Col. IV 122 1 

P.Col. X 278,4 1 

P.Col. X 280,13 1 
P.Coll.Youtie | 4v 1 
P.Corn. 34 1 

P.Diog. 20,6 1 

P.Eirene Ill 25, 3 1 
P.Fam. Tebt. 15,iii,79 1 
P.Fam.Tebt. 29,44 1 
P.Fay. 204,9 1 
P.Fuad.Univ. 1 1, 2 
P.Gen. | 144,23 1 
P.Gen. inv. 111 1 


P.Gen. inv. 111 1 

P.Gen. inv. 111 1 

P.Gen. inv. 111 1 

P.Gen. inv. 111 1 

P.Gen. IV 192,10-11 1 
P.Got. 1 

P.Got. 54 descr. 1 

P.Grenf. II 1 

P.Grenf. Il 15 1 

P.Hamb. | 16,22 1 
P.Hamb. IV 244,12 1 
P.Haun. II 22,5 1, 2 
P.Herc. 229 + 242 + 243 + 247 1 + 248 + 433 + 437 + 1077 + 1088 + 1098 + 
1428 + 1609 + 1610 + 1648 + 1788 
P.Herc. 408 + 409 + 1117 + 1573 1,2 + 1672 
P.Herc. 444 + 460 + 466 + 1073 1 + 1074/1081 + 1081 
P.Jena Il 8,7 1 

P.Johnson 1 

P.Koln V 206r 1, 2, 3 
P.Lond. | 1 

P.Lond. III 774 1 

P.Lond. III 775 1 

P.Lond. III 776 1 

P.Lond. III 777 1 

P.Lond. III 778 1 

P.Lond. Ill 1324,7 1 
P.Lond.Lit. 166 1, 2 
P.Lond.Lit. 166, iv,6 1 
P.Lond.Lit. 28 1 

P.Mert. | 12 1 

P.Mert. | 12,13— 24 1 
P.Mich. inv. 6972 1 

P.Mich. XVII 758 1 

P.Mich. XVII 758 H 1, 2 
P.Mich. XVII 758 H r 1 
P.Mich. XVII 758 Hr 11 1 
P.Mich. XVII 758 H v 2, 3 1 
P.Naqlun II 22 1 

P.Oslo II 54 1 

P.Oslo Il 54,6 1 

P.Oslo Ill 72,9 1 

P.Oslo inv. 1576 1, 2 
P.Oslo inv. 1576,5 1 
P.Oxford Sackler s.n. 1 
P.Oxy. 11, 2 

P.Oxy. | 40 1 

P.Oxy. | 135 1 

P.Oxy. | 136 1 

P.Oxy. | 137 1 


P.Oxy. | 138 1 

P.Oxy. | 144,4 1 

P.Oxy. | 155,4 1 

P.Oxy. I] 234 + LIl 3654,92 1 
P.Oxy. VI 963 1 

P.Oxy. VIII 1088 1 

P.Oxy. IX 1184 1, 2 

P.Oxy. IX 1184v 14 

P.Oxy. XIX 2221r 1, 2, 3 
P.Oxy. XIX 2238,18, 21 1 
P.Oxy. XXXIII 2665,17, 19 1 
P.Oxy. XXXIV 2724,20 1 
P.Oxy. XLII] 3117 1 

P.Oxy. XLII] 3117,13-14 1 
P.Oxy. LIX 4001 1 

P.Oxy. LIX 4001 ,30-1 1 
P.Oxy. LVI 3851 1 

P.Oxy. LXI 4131,16, 39 1 
P.Oxy. LXIV 4441,x,27 1 
P.Oxy. LXX 4790,16, 19, 30 1 
P. Oxy. LXXIV 4972 1 

P.Oxy. LXXIV 4973 1 

P.Oxy. LXXX 5239 1, 2, 3 
P.Oxy. LXXX 5239,15 1 
P.Oxy. XVI 1898,19, 38 1 
P.Petr. 112 1 

P.Petra | 4,5 1 

P.Rein. II 20 1 

P.Ross.Georg. | 20 1 
P.Ross.Georg. V 34,2 1 
P.Ryl. Ill 478 1 

P.Ryl. Ill 529 1, 2 

P.Ryl. Ill 529r 1 

P.Ryl. Ill 529r,57 1 

P.Sorb. inv. 2302 1 

P.Sorb. inv. 2303 1 

P.Stras. VIII 772 1 

P.Stras. VIII 772,6, 9, 15, 21 1 
P.Strasb. inv. 849 1 

P.Strasb. inv. 1187 1, 2, 3 
P.Strasb. inv. 1187, A, i,11 1, 2 
P.Strasb. inv. 1187, A, i,14 1, 2 
P.Strasb. inv. 1654 1 
P.Strasb. Copte inv. 563,6-—7 1 
P.Tebt. Il 272v 1 

P.Tebt. Il 272v,4—5 1 

P.Tebt. Il 318,23 1 

P.Tebt. Il 679 1 

P.Tebt. Ill.1 771 1 


P.Tebt. Tait 39 1 
P.Tebt. Tait 40 1 
P.Tebt. Tait 41 1 
P.Turner 14 1 


« P.Yale Il 134 1 

* PSI inv. 3783 1 

* PSI inv. 22011,48 1 

* PSI IV 428,89-90, 92 1 
¢ PSI V 454,19 1 

* PSI VI 718 1 

* PSI X 1180a iii,12 1 

* PSI XV 1510 1 

* SB 14668, 4 1 

* SB VI 9625,23 1 

* SB X 10559,1 1 

* SB XIV 1629 1 

* SB XVIII 13732,13 1 

* SB XVIII 13947,15 1 

* SB XX 14990,15 1 

* SB XXIV 15915,6 1 

* SB XXIV 16290 1 

* SB XXIV 16290,2 1 

* SB XXIV 16291,4 1 

* SB XXVI 16458 1 

* SB XXVI 16458,10-13 1 
* SB XXVI 16796,10-11, 16 1 
* Sel.Pap. | 85 1 

* Stud.Pal. Ill 314,1 1 

* Stud.Pal. VIII 791,1 1 

* Stud.Pal. VIII 875,2 1 

* UPZ159 1 

* UPZ17 1 


lll. Literary sources 


« Aesch. Th. 454-5 1 

* Aet. VI 50,75-9 1 

* Aet. XV 5,50-2 1 

* Aet. XV 5,50-1 1 

¢ Ammon. Diff. 390,2 1 

* Cat.Cod.Astr. | 104,26—30 1 
* Dsc. MM Ill 147 1 

« Etym.Magn. 520,26-7 1 

¢ Eur. Or. 1-25, 401-25 1 

* Gal. Antid. 15 1 

* Gal. Comp.med.gen. 1 

* Gal. Comp.med.gen. | 3 1 

* Gal. Comp.med.gen. | 4 1 

* Gal. Comp.med.loc. 11 1 

* Gal. Comp.med.loc. | 2 1, 2 


Gal. 
Gal. 
Gal. 
Gal. 
Gal. 
Gal. 
Gal. 
Gal. 


Comp.med.loc. || 3 1 
In Hp. Art. 122 1 

In Hp. Epid. || 4 1 

In Hp. Epid. || 8 1 

In Hp. Off. 16 1 

In Hp. Off. II 22 1 
Indol. 33-5 1 

Libr.pr. || 91-3 1 


Heliod. Cheirurg. 1 
Herod.Med. ap. Orib. 1 


Hom. 
Hom. 
Hom. 
Hom. 
Hom. 
Hom. 
Hom. 


H.15 1 

I. VI 287-8 1 

I. VII 433-47 14 
I. VII 526 1 
WX 4 

I. X 101 

I, XVIL578 1 


Hp. Aph. 1 

Hp. Art. 7,36—43 1 
Hp. Epid. |ll 16 14 
Hp. Fract. 37 1 
Hp. Off. 2 1 

Nic. Ther. 333-4 1 
Nic. Ther. 387 1 
Nic. Ther. 394 1 


Orib. 
Orib. 
Orib. 
Orib. 
Orib. 
Orib. 
Orib. 
Orib. 
Orib. 


Coll.med. 1 
Coll.med. \V 615,19 1 
Coll.med. \V 620 1 
Coll.med. V 30,6—7 1 


Coll.med. V| 23,9,4—10,1 1 


Coll.med. IX 14,6,1-3 1 


Coll.med. XLVI 11,2,1-3,3 1 


Syn.Eust. lll 28,6, 9 1 
Syn.Eust. Vil 45 1 


Paul.Aeg. III 2,2,4-6 1 
Paul.Aeg. III 79,1 1 
Paul.Aeg. VI 8,1,10—11 1 
Paul.Aeg. V1 59,1,9 1 
Paul.Aeg. VI 90,4,10-11 1 
Paul.Aeg. VI 99,2,1-6 1 
Paul.Aeg. VI 101,1,3-6 1 


Paul. 


Aeg. VII 17,31 1 


Philod. Mus. IV 1 
Philod. Piet. 1 
Philod. Poem. 1 1 
Philod. Rhet. 2 1, 2 


Plat. 
Poll. 


Meno 95e 1 
On. IV 203 1 


Procl. Or. 18 1 


¢ Ps.Gal. Def.med. 1 

¢ Ps.Gal. Def.med. 424 1 

¢ Ps.Gal. Introd. X\| 11 1 

« Ps.Hp. Ep. 3, 6, 6a 1 

¢ Ps.Hp. Ep. 4, 4a, 5 1 

* Ps.Hp. Vet.med. 2,17-8 1 
* Ps.Sor. Quaest.med. 1 

¢ Sch. Aesch. Th. 454h 1 

¢ Sch. Eur. Or. 1-500 1 

¢ Sch. Eur. Or. 1 1 

* Sor. Gyn. | 27,3,4 1 

* Sor. Gyn. Il 37,5,1-4 1 

* Steph. /n Hp. Progn. || 1 1 
* Suda 6 896,1-3 1 

* Thgn. 434-8 1 

¢ Zonar. Lex. 6 524,27 1 


IV. Greek concordance 


N.B. Single Greek letters are not indexed. 


* ayKdAais 1 

* QYKIOTpEIO! 1 
* GdedAge 1 

* GdEAQGc 1 

« A®avadoiocg 1 
* Adup 1 

* A®Uc 1 

* la 1 

* aiAaBa 1 

* aitfhoas 1 

* aitiai 1 

* aiwpntéov 1 

« AAdgEdvopiov 1 
* GAha 1 

* GAAoI 1 

* QueTakivntov 1 
* GUTTEAIKOV 1 

* augioBaiva 1 
* augiogaiva 1 
* -av 1 

* avti 1 

* avdAoya 1 

* AvdpyOOTOS 1 
* avdpotiov 1 

* avdppotrov 1 
* avdpportroc 1 


asIow 1 
aoAiccay 1 
aTrodd0w 1 
aTrodwow 1 
atrAf 1 

arré 1 

aTrddo0¢g 1 
atrokpeivacdal 1 
aTTupEeTWwV 1 
Gpakos 1 
dpue_eva 1 
Aptranaios 1 
Gptous 1 
Gptwv 1 
aob_evéwv 1 
daotracduEba 1 
daoTradoyai 1 
dotrdoac6Gal 1 
dotrdoeo8ai 1 
aogahéotepov 1 
auTa 1 

auto 1 

auto 1, 2 
auTw 1 
G@pwvos 1 


Beddvny 1 

BIBAEia 1 
BIBAEidiov 1 
BIBAiSIov 1 
BIBAIOPUAGK(E)IOV 1 
BIBAIOMUAGKEIOV 1 
BIBAIoguAakKEiov 1 
BIBAIOMUAGKIOV 1 
Biov 1 

BAEgapa 1 
Bon®nyudtwv 1 
Bopeirai 1 

Boteitai 1 


ydaAaktoc 1 
yap 1 
yeiveo@ai 1 
yeivoyal 1, 2 
YEIVWOKI 1 
VEIVWOKIV 1 
yeveo@ai 1 
yiyveo8ai 1 
yiyvetai 1 
yiyvouai 1 


YIYVWHEVO! 1 
YIVVWOKEIV 1 
YIYVWOKW 1 
yivoya 1, 2 
yivouEva 1 
YIVWOKEIV 1 
YIVWOKW) 1 
yvovai 1 
ypageioa 1 
yuvaikdc 1 
Satta 1 
dakvnpov 1 

6€ 1,2 

oei 1 

deATApiov 1 
diekpavoGyEv 1 
oidayeliv 1 
diaoteiAavtes 1 
digdpiov 1 
dipuh} 1 

56En 1 

opov 1 
SuoaAyéotatov 1 


eav 1,2 
EyKAivov 1 
EYXEIPIOIOV 1 
€d0¢ 1 

€dpa 1, 2 
EdwAla 1 
ESWAIOV 1 
ESwAiwv 1 
€Coyal 1 

lev 1, 2 

ei 1 

eipnke 1 
eikool 1 

eiui 1 

eiva 1 

Blo 1; 2,3. 
e101 1 
eicomTpov 1 
Ek- 1 

&K 1 
EkaTaoTiis 1 
EKATOOTOIG 1 
ekaTooTis 1 
EAaBa 1 
edapa 1 
EAaBov 1 


éAOeiv 1 

ENOn 1 

EAGiv 1 
EAUTPOEIONG 1 
éAuTpoEloouc 1 
EuAdOTOS 1 
EUTIAGOTOG 1 
EuTTAGOTpOG 1 
év 1,2 

Eoikuia 1 

étrei 1, 2 

étri 1, 2 
ETTIKAAOULEVNG 1 
ETTIOTOANY 1 
eTTITHOiwv 1 
Epya 1 
EPUEV(E)ia 1 
épupoeidovc 1 
EPUTPOEIOF 1 
é¢ 1 

EooTITpOV 1 
éotiv 1, 2 
éogpayiouévai 1 
EOXQUE 1 

E0XO 1 

EOXW 1 

ETEpOV 1 

Exelv 1 

EXEIC 1 

EXIC 1 


cevyn 1 
¢ntAoai 1 
cHUpVN 1 
Cuyos 1 


1 
houTepov 1 
rduTepov 1 
nyEis 1 
nhyépa 1 
Huov 1 
Ngiwoav 1, 2 


OdAayoy 4 
Oeddwpoc 1 
OgooaAik6v 1 
Bem 1 

@f\Kal 1 

OnKn 1 


OnyEépa 1 


iatpeiov 1 
IATPIKOG 1 
iatpot 1, 2 
intpeiov 1 
ikool 1 

iva 1 

-10¢ 1 
iTTTrEiav 1 
i¢ 1,2, 3 


Kad’ 1 

Ka8édpa 1, 2 
KaBedpapiov 1 
KaBESpEIOV 1 
KabEdpiov 1 
KaBEdpIOG 1 
KaQedpiou 1 
KaBedpiw 1 
KaBedpiwv 1 

Kai 1, 2, 3,4, 5,6 
KaAdv 1 

KaAovo! 1 

KaAWS 1 
KGUvovTa 1 
Kduvuwv 1, 2 

Kat’ 1 

Kata 1, 2 
KaTayeivoual 1 
KaTayiyvoual 1 
KOTOKEKAIMEVOV 1 
KOTOKEKAIMEVOG 1 
KaTdporrov 1 
KaTappotrov 1 
kateBrcato 1 

KE 1 

KEKAINEvov 1 
KE@aAny 1 
KiBapidtns 1 
KAivn 1 
KAivoKadédpiov 1 
kAivTrp 1 
KOIMICETW 1 
KoIvwviv 1 
KoiTns 1 
KoAakeiac 1 
KoAAoUpia 1 
KoAAoupiwv 1 
KOTTOU 1 


KU8pa 1 
Kupiws 1 
AaAwv 1 
AauBavw 4 
A€yWUEV 1 
Aiba 1 
AiBa 1 
\ivov 1 
MoTwv 1 
AWTWVv 1 
Aitpa 1 


UGAAov 1, 2 
Mapovati 1 
Mapovatog 1 
HE 1 

Heya 1 

Heév 1 
yepidav 1 
yeTa 1, 2 
HETAOXEIV 1 
yun 1 

uiav 1 
HIKpdOv 1 
Hitou 1 

yo! 1, 2, 3,4 
Yovng 1 

you 1, 2 


vapOné 1 
vauAais 1 
vauaoic 1 
Nexoutou 1 
Nexoutw 1 

vnog 1 
voookouElov 1, 2 
VOOOKOWIoVv 1 
VuKTi 1 


Eepov 1 


6yKoG 1 
6Oev 1 
oi 1, 2 
oiac 1 
oivov 1 
Okou 1 
OKTW 1 
OKwe¢ 1 
OAKOG 1 


OUOAOY@ 1 
Ovoua 1 
6vta 1 
ofuyyeiou 1 
oguyyiov 1 
oguyyiou 1 
Opyava 1 
opyavikoic 1 
doa 1 
OouTITpOV 1 
Otav 1,2 
OTE 1 

oti 1 

ovykKia 1 
OUK 1, 2 
opPadyod 1 


TIGAElv 1 

TIGAIV 1, 2 
TravtTa 1 

Tapa 1 
Trapayeivetai 1 
Trapayeivoyual 1 
Trapayiyveta 1 
Trapayiyvouai 1 
Napadgioou 1 
Napadioou 1 
TIApPEOXEV 1 
TrapSevikdv 1 
TIAOXOVTA 1 
TaoKwv 1, 2 
TrEAEl 1 
TréWWwas 1 
Treywas 1 
TrEWWOV 1 
TIEUWW 1 
TrETTEpI 1 

Trepei 1 

Trepi 1, 2,3 
TIEPIVEIVONEVWWV 1 
TTEPIYIYVOHEVWV 1 
TINXEOG 1 
Tnpa 1 

Trnpav 1 
TINXEWS 1 
TrAEE! 1 
Trolfjoa 1 
TIOINGE 1 
TIOINGEISC 1 
TIOINTIKN 1 


TIOIOUNEVWV 1 
TTOAAOUG 1 
Tropsia 1 

Tropa 1 

Tropre 1 
TIpeoBiotTaTov 1 
TrpeoBuoTaToV 1 
Trpnvi 1 
TIPOOIAKATEXOUCA 1 
TIPOKUTITOV 1 
TIpdc 1 
TpOOMopas 1 


TITEPUYEIOV 1 
TITEpUYEIOU 1 
TTUNOIC 1 
TTWAIKOV 1 


pa 1 
-pa 1 
PGyWaTos 1 


oatrpév 1 

o€ 1 
o8uvoyxAouuETEp 1 
olvapwv 1 
OuEIloTdv 1 
OUEIAIWTOV 1 
OUIAIWTOV 1 
ouvpvn 1 

oo! 1, 2, 3, 4 
oou 1 
oTroudaodTw 1 
ooxnuatioavtes 1 
oTpattedc 1 
oTpatnyds 1 

ou 1 
OUUUETPOU 1 
ouv 1,2 
opddpa 1 
oxfya 1, 2 
oxfyati 1 
oxnuatiZeiv 1, 2 
OXNUaTICEcOw 1 
oxnuaridiv 1 
oxnuatigovta 1 
oxnuartidw 1 
OXNUATIKOv 1 
oxnuatioavtes 1, 2 
oxnudtwv 1 
oxioToU 1 
OXOAAGOTIKOG 1 
ompa 1 

TE 1,.2;-3 

Taic 1 

TautTnh 1 

TAXEIOV 1 
TAXIOV 1 

Te 1 

TH 1, 2,3 

Th 1,2, 3 

Tig 1, 2, 3,4, 5 


Ti 1 

TO 1,2 
Tolyapoiv 1 
Toig 1, 2 
TOO! 1 

Tov 1, 2,3 
TOTIOG 1 

Top 1 

ToU 1, 2,3 
Tous 1 

Tpia 1 

Tpixa 1 
TpOTIOG 1 
TUXNI 1 

TO 1 

Tov 1, 2,3,4,5 
UYpoKNAN 1 
Udati 1 
Udpeiav 1 
UdpeEiac 1 
Udpia 1 
Udpial 1 
Udpiav 1 
Udpiac 1 
Udpiokn 1 
UdpoKnAn 1 
Uowp 1, 2 
ULETEPOG 1 
uTrap 1 
UTTapXw 1 
Utredeigauev 1 
UTTép 1 
UTreppu(e)ia 1 
UTrep@ueia 1 
UTrep@uia 1 
UTmnpétai 1 
UtrTiov 1 


@apyaKo@fKai 1 
@apyaKko8nkn 1, 2 
@apyakoOnkny 1 
@dapyakov 1, 2 
@Aoldv 1 

@Aolds 1 

gAotv 1 

@Aois 1 

@opeitai 1 

@ws 1 


xaABavn 1 


* xadKav@ou 1 
* yxelpoupyeia 1 
* xElpoypageioa 1 
* xElpoypagia 1 
* xelpoupyinv 1 
* xiuéBAac 1 

* XINETAGG 1 

* xpeworeiv 1 

* xpdvoc 1 

* xpM 1,2 

* xUTpG 1 

* xwpiov 1 

* xwpic 1 


* WKTW 1 
* wvoud 1 
* WoTTEp 1 


V. XML tags 


* <ab> 1 

* <abbr> 1 

* <add> 1 

* <add place="above"> 1 

* <add place="below'> 1 

* <add place="inline"> 1 

* <add place="interlinear"> 1 
* <add place="left"> 1 

* <add place="right"> 1 

* <am> 1 

* <app> 1, 2 

* <app type="variant"> 1 

* <bibl> 1 

* <choice> 1, 2, 3 

* <corr> 1, 2,3 

© <d n="xx" v="xx"> 1 

* <del> 1 

* <del rend="corrected"> 1 
* <del rend="cross-strokes"> 1 
* <del rend="erasure"> 1 

* <del rend="slashes"> 1 

* <desc> 1, 2 

* <div type="textpart"> 1 

« <e> 1 

* <ex> 1,2 

* <expan> 1, 2 

° <f> 1 


<figDesc> 1 

<figure> 1 

<foreing xml:lang=""> 1 

<g> 1,2 

<g type="check"/> 1 

<g type="dipunct"/> 1 

<g type="horizontal-stroke"/> 1 

<g type="middot"/> 1 

<g type="mid-punct/"> 1 

<g type="monogram"> 1 

<g type="sinusoid"/> 1 

<g type="slanting-stroke"/> 1 

<g type="tripunct'/> 1 

<g type="vertical-stroke"/> 1 

<gap> 1, 2,3 

<gap reason="ellipsis"> 1, 2 

<gap reason="illegible"> 1 

<gap reason="lost"> 1, 2, 3 

<handShift> 1 

<handShift new="m2"/> 1 

<hi> 1, 2, 3 

<hi rend="acute"/> 1 

<hi rend="asper"/> 1 

<hi rend="circumflex"/> 1 

<hi rend="diaeresis"/> 1 

<hi rend="eisthesis"/> 1 

<hi rend="ekthesis"/> 1 

<hi rend="lenis"/> 1 

<hi rend="subscript"> 1 

<hi rend="superscript"> 1, 2 

<hi rend="supraline"> 1 

<hi rend="supraline-underline"> 1 

<hi rend="tall"> 1 

<I r="xx"> 1 

<Ib/> 1, 2,3 

<Ib n="xx"> 1, 2 

<lb n="xx,md"/> 1 

<Ib n="xx,minf"/> 1 

<lb n="xx,ms"/> 1 

<lb n="xx,msup"/> 1 

<lb n="xx" break="no"> 1 

<lb n="xx" rend="indent'/> 1 

<lb n="xx" rend="outdent"/> 1 

<lem> 1, 2,3 

<milestone rend="box" unit="undefined"/> 1 
<milestone rend="coronis" unit="undefined"/> 1 
<milestone rend="diple" unit="undefined"/> 1 
<milestone rend="diple-obelismene" unit="undefined"/> 1 
<milestone rend="paragraphos" unit="undefined"/> 1 


<note> 1, 2 

<num value="xx"> 1, 2 
<orig> 1, 2, 3, 4 

<p> 1 

<q> 1 

<rdg> 1, 2, 3,4 

<reg> 1, 2, 3,4 

<sic> 1,2, 3 
<space> 1 
<subst> 1 
<supplied> 1, 2 

<supplied reason="lost"> 1, 2, 3 
<supplied reason="omitted"> 1, 2 
<supplied reason="parallel"> 1 
<surplus> 1 

<title/> 1 

<unclear> 1 


Notes 


1 See already DEGANI 1992, and more recently MAGNANI 2008, 135-7; also 
M. Magnani in this volume. 

2 Cf. REGGIANI 2017, 215-7. 

3 Cf. REGGIANI 2017, 176. 

4 A multitext is basically a dynamic collection of multiple critical editions, 
a network of versions with a single root. As Monica Berti described it, “[i]t 
produces a representation and visualization of textual transmission 
completely different from print conventions, where the text that is 
reconstructed by the editor is separated from the critical apparatus that is 
printed at the bottom of the page. [... It] allows the reader to have a 
dynamic visualization of the textual tradition and to perceive the different 
channels of both the transmission and philological production of the text 
that is usually hidden in the static, concise, and necessarily selective critical 
apparatuses of standard printed editions. Producing a multitext, therefore, 
means producing multiple versions of the same text, which are the 
representation of the different steps of its transmission and reconstruction, 
from manuscript variants to philological conjectures” (BERTI forthcoming, 
4). Cf. REGGIANI 2017, 266 ff. 

5 The HMT project concept results from the statement that the Homeric 
textual evidence does not comply with the traditional philological view of 
textual variants stemming from one archetype, since a true original 
Homeric text never existed (cf. BIRD 2010): a somehow “agnostic” (BODARD 
— GARCES 2009, 96 n. 31) environment where all witnesses are transcribed 
and juxtaposed, without preference for any of them. See M. Magnani’s 
chapter in the present volume for a critical view of this idea. 

6 Ancient fragments are characterized by a high level of textual complexity, 
in the relationships among the actual text in which they are embedded, its 
critical edition (interpretation), the original source (attribution), the 
quoting source (witness), etc.: cf. BERTI forthcoming. 

7 Cf. BOLTER 1991; REGGIANI 2017, 263 ff. 

8 ROMANELLO — BERTI — BOSCHETTI — BABEU — CRANE 2009, 165 

9 BAGNALL — GAGOS 2007, 74. 

10 BOLTER 1991. 

11 Cf. BODARD — GARCES 2009 

12 ROMANELLO — BERTI — BOSCHETTI — BABEU — CRANE 2009, 158. An 
ontology is a formal definition of types, properties, and interrelationships 
of the entities belonging to a certain domain of knowledge. In other words, 
it compartmentalizes the variables needed for some set of computations 
and establishes the relationships between them. 

13 Cf. YOUTIE 1963, 22-3. 


14 Cf. e.g. BAGNALL 1995. 

15 Cf. HANSON 2002, 196; SCHUBERT 2009, 197. 

16 Cf. YOUTIE 1963, 27-32; HANSON 2002, passim; SCHUBERT 2009, 212-3. 
17 As I pinpointed in REGGIANI 2017, 2-6, this is the basic raison d’étre of 
Digital Papyrology. 

18 Cf. e.g. BAUMAN 2000; 2007; 2011. 

19 On the collaborative structure of the database cf. REGGIANI 2017, 232 ff. 
All editorial interventions are kept recorded in a “History” log, which is 
available to every user: see L. Berkes’ article in this same volume for a 
screenshot of a sample editorial history on Papyri.info. 

20 Cf. REGGIANI 2018a. Some remarks on the inconsistent treatment of 
iotacism can be found also in J. Stolk’s and M. Vierros’ contribution to this 
volume. 

21 Cf. DEPAUW — STOLK 2015 and J. Stolk’s chapter in this volume. 

22 A quick survey of a sample search in Papyri.info can give a global idea of 
this trend: http://papyri.info/search? 
STRING] = yetvou&targetl = TEXT&no_caps1 = on&no_marks1 = on&STRING2 = NO?’ 
+ylyvo&target2 = TEXT&no_caps2 = on&no_marks2 = on. 

23 napay{e}iwetat L mapaytyvetat in BGU XVI 2651,6; yeiveobat I. 
ylyveo8at in Chr.M. 172,i,15; katayetv[o]pfat] L kKatayiyvopat in P.Bodl. 
117,i,9; napayetvopat L rapayiyvopuat in P.Haun. II 22,5; weptyetvonevav 
I. we€plylyvonEevav in P.Stras. VIII 772 passim. Note the double possible 
regularization yiyveoOat or yevéoOat advanced for yeiveoOat in P.Col. X 
280,13. 

24 Another sample search: http://papyri.info/search? 
STRING1 = ylyvou&target1 = TEXT&no_capsl = on&no_marks1 =on. 

25 Cf. STOLK 2015b. 

26 For more similar cases cf. STOLK 2015a, 85 ff., and 2015b. 

27 Cf. DEPAUW — STOLK 2015, 213. See also STOLK 2015a, 93. 

28 On these topics cf. REGGIANI 2018a, 2018b, 2018c, and J. Stolk in this 
volume. 

29 Cf. REGGIANI 2017, 260 ff. 

30 GAGOS 2001, 516. 

31 GAGOS 2001, 514-6. 

32 GAGOS 2001, 515 n. 8. 

33 At the same time constructed artefacts, being created by people, and 
interpretable texts, they “can hold the same potential evidentiary value as 
any other kind of artifacts”. 

34 See also the observations by M. Magnani in this volume. 

35 LAME 2014 describes this idea (with reference to ancient epigraphs) 
through Foucault’s philosophical concept of dispositive: the message of the 
text-bearing object can be completely understood in relation with a 
complex network of many other heterogeneous pieces of information. The 
ultimate purpose is “to digitize also the network that connected those 


information systems, instead of digitizing each individually”. 

36 The platform Sematia, discussed by M. Vierros in this volume, is a nice 
example of how the transcription of the actual papyrus text can be aligned 
to a ‘regularized’ layer of the same text, so that any possible information is 
kept in an interactive way. 

37 DAMON 2016, 218. 

38 With DCGMP I refer to the whole digital corpus of the Greek medical 
papyri as resulted from the work of the DIGMEDTEXT project mentioned in 
the Introduction to this volume. The title Corpus dei Papiri Greci di Medicina 
Online (“Online Corpus of the Greek Medical Papyri”) refers to the first 
stages of the project. Bibliography: ANDORLINI —- REGGIANI 2012; REGGIANI 
2015; 2016a; 2017, 273-5; ANDORLINI 2017; BERTONAZZI 2018a, 24-9; 
REGGIANI 2018b; http://www. papirologia.unipr.it/CPGM. 

39 Cf. REGGIANI 2018d. 

40 Cf. ANDORLINI 1997a. 

41 Cf. ANDORLINI — REGGIANI 2012, 138-9; BAGNALL 2012, 4. 

42 See the chapter by R. Ast and H. Essler in this volume. 

43 Cf. ANDORLINI 1993; REGGIANI 2018e. 

44 Cf. NUTTON 1990. 

45 On official examinations of physicians see REGGIANI 2018f. 

46 Cf. NUTTON 1972; NIEDDU 1992, 555-7; ANDORLINI 2003, 14. 

47 On the collation of annotated copies, always according to Galen’s words 
Cin Hp. Off. I 22 = XVIIb 863,14-865,5 K.; In Hp. Epid. 1 8 = XVIla 
634,3-7 K.), cf. ANDORLINI 2003, 15, who recalls (note 15) the story of 
Mnemon, who took the third book of Hippocrates’ Epidemics from the 
library of Alexandria and brought it back with the marginal addition of 
marks indicating clinical histories, traced with dark ink and big letters, in 
imitation of the original handwriting. Cf. also BONATI 2016b, 63-4, and see 
below. 

48 Cf. ANDORLINI 2003; REGGIANI 2018e and 2018¢. 

49 Cf. REGGIANI 2018a for further details, and see below. 

50 HANSON 1997. 


51 D. LEITH, P.Oxy. LXXX 5239. 

52 ANDORLINI 1997a, 19 (cf. ibid., 21-2) 

53 BERTI forthcoming, 2. 

54 OWENS 2011. 

55 This is indeed a key issue in Digital Papyrology (cf. REGGIANI 2017, 
passim) as felt by the very first ‘fathers’ of the papyrological databases (cf. 
TOMSIN 1970, 476). 

56 I started envisaging this strategy for the medical corpus in REGGIANI 
2015, where I sketched some possible annotation layers (the article stems 
from a conference paper delivered in 2012, at the very beginnings of the 
DCGMP project). I revised my argument in REGGIANI 2016a. 

57 See the overview discussed by J. Stolk in this volume. In the following 
pages, I will be referring to the online Leiden+ guidelines at http:// 
papyri.info/docs/leiden_plus. 

58 See the query cases mentioned by M. Vierros and G. Celano in this 
volume. 

59 “This ‘intertextuality’ of the text is what G. Genette would call 
‘transtextuality’. It is not, perhaps, accidental that postmodern theories on 
language and ‘text’ developed more or less at the same time with the spread 
of the electronic media” (GAGOS 2001, 515 n. 8). Cf. GENETTE 1992. I 
outline the possible exploitation of Genette’s textual theory in relation with 
the complex textuality of Greek medical papyri in REGGIANI 2018c and 
2018e. 

60 YOUTIE 1963, 22-3. 

61 On these resources cf. REGGIANI 2017, 39 ff. (catalogues) and 14 ff. 
(bibliographies) respectively. 

62 See R. Ast and H. Essler in this volume. 

63 Cf. MARGANNE — MERTENS 1997 and the online resources cited in 
REGGIANI 2017, 34. 

64 MARGANNE 1981a; ANDORLINI 1993. 

65 See also M. Vierros’ remarks about metadata of documentary papyri in 
her article for this volume. 

66 ANDORLINI 1997a, 21. 

67 On standardization in papyrological metadata see REGGIANI 2017, 74-8. 
68 BERTONAZZI 2018a, 48-51 (see also pp. 51 ff.). 

69 In compliance with one of the original goals of the Corpus dei Papiri 
Greci di Medicina, i.e. the historical-scientific perspective described by 
ANDORLINI 1997a, 23. 

70 On translation alignment cf. e.g. VERONIS 2000. 

71 See the remarks by R. Ast and H. Essler in this volume. 

72 See the contribution by G. Celano in this volume for the problems given 
by non-breaking lines in the digital papyrus texts. 

73 An attempt of this can be found at http://litpap.info/dclp/60175 
(P.Oxy. IX 1184, Hippocratic letters). 


74 See below and the Anagnosis section of the chapter by R. Ast and H. 
Essler in this volume; cf. REGGIANI 2017, 151 ff. 

75 Cf. CORAZZA 2018a; see below for ‘milestones’. 

76 The lexical and linguistic study has always been a primary purpose of 
the Corpus dei Papiri Greci di Medicina: cf. ANDORLINI 1997a, 24. 

77 On the definition of linguistic corpus cf. SINCLAIR 1996; in general on 
corpus linguistics cf. LUDELING — KYTO 2008-09; LUDELING 2011; and see M. 
Vierros and I. Bonati in this volume. On the theoretical and practical 
correctness of treating Greek medical papyri as a proper textual corpus I 
think there must be no doubt. A linguistic corpus is usually intended as a 
selection of sample texts representative enough of a language, and though 
the medical papyri at our disposal come from a random and incomplete 
selection, they can be considered as the entire reference population rather 
than as a sample of a larger group, so that linguistic annotation seems to 
me absolutely feasible. 

78 Cf. REGGIANI 2017, 178 ff., and M. Vierros in this volume. 

79 Cf. REGGIANI 2015 and 2016a; BERTONAZZI 2018b. 

80 Cf. MARAVELA — REGGIANI 2018. ROUED-CUNLIFFE 2014 described how 
digital encoding can prove useful for the analysis of grammar patterns of an 
ancient textual corpus (the Vindolanda tablets). A seminal project on 
annotating a corpus of private letters on papyrus, conducted by S. Porter 
and M. O’Donnell, has produced a number of valuable observations about 
modes and tenors of discourse, structures of information, semantic patterns, 
and so on (PORTER — O’DONNELL 2010). 

81 “[L]la possibilita di identificare alcuni papiri con trattazioni di un autore 
tramandato solo indirettamente inserisce tasselli nuovi nella complessa 
stratificazione della trasmissione indiretta, soprattutto quando sono i papiri 
i soli testimoni diretti di autori tramandatici per excerpta e citazioni 
(Apollonius Mys, Heras, Heliodorus, Herodotus Medicus)” (ANDORLINI 
1997a, 22). 

82 ANDORLINI 2006 pinpointed the existence of an expressive strategy of 
medical technical texts: “L’osservazione di tali fenomeni, e del loro 
riproporsi costantemente nella tradizione dei testi medici greci su papiro, 
permette di riconoscere diverse fasi e livelli in cui il sapere tecnico 
contenuto nella ricetta medica veniva materialmente veicolato al lettore/ 
consumatore attraverso moduli espresssivi e dispositivi tecnici, visivi, fisici, 
che formano una sorta di koiné, un tutt’uno tra lingua tecnica e scrittura 
speciale dei testi. Di qui la suggestione di rintracciare una specie di ‘gergo’ 
nei connotati di quel particolare linguaggio criptico, grafico ed espressivo, 
che comunica all’interno di una determinata categoria professionale: il 
medico, gli altri medici (i colleghi), il farmacista, il commerciante di 
farmaci, il paziente. Si tratta di modi speciali di usare parole e segni 
attraverso i quali le competenze medico-terapeutiche tendono a 
specializzarsi all’interno di una corporazione di addetti alla professione 


medica” (p. 153). 

83 “L’analisi sintattica attraverso ’annotazione in un cosiddetto ‘treebank’ 
potrebbe mostrare pit’ chiaramente la struttura del testo e facilitare il 
confronto tra il testo veicolato dal papiro e la tradizione manoscritta, 
soprattutto nel caso di papiri per cui si sospetti una possibile paternita” 
(BERTONAZZI 2018a, 74). The case of surgical author Heliodorus is 
paradigmatic: “l’analisi del lessico tecnico dei papiri chirurgici ha portato a 
individuare paralleli testuali tra testo tramandato su papiro e tradizione 
manoscritta, talvolta significativamente stringenti come nel caso di 
P.Strasb. inv. 1187 e diversi passi di Eliodoro ap. Oribasio. Alcuni altri 
papiri (P.Lond.Lit. 166, P.Gen. inv. 111, P.Fuad.Univ. 1, P.Ryl. 3.529), 
come gia notato dagli studiosi, sono caratterizzati da una forte presenza di 
‘lessico eliodoreo’ e da alcune peculiarita proprie del modus operandi del 
chirurgo, come la predilezione di interventi chirurgici che siano il pit sicuri 
possibili per il paziente, nonché del modus scribendi, come il ricorso 
frequente alla prima persona — singolare o plurale -, la definizione con 
esattezza delle posizioni ‘topografiche’ della parte operata (dentro, fuori, 
sopra, sotto), e una sostanziale semplicita delle strutture sintattiche usate. 
Ad oggi, i tentativi di attribuire i papiri citati alla paternita di Eliodoro si 
sono basati quasi esclusivamente su criteri lessicali nel confronto tra il testo 
tramandato su papiro e sui capitoli di Oribasio che portano la titolatura ‘da 
Eliodoro’. Una nuova possibile strada offerta dalle nuove tecnologie della 
papirologia digitale é quella costituita dall’annotazione sintattica dei testi: 
un’analisi pit. accurata non solo del lessico, che come é noto é la parte pit 
‘volatile’ della lingua, ma delle strutture morfologiche e sintattiche dei 
passi del compilatore tardo in sinossi con i testi dei papiri, sia pure nella 
limitatezza delle pericopi testuali preservate, potrebbe gettare nuova luce 
anche su questo aspetto tra i piti incerti quanto stimolanti della ricerca” 
(BERTONAZZI 2018a, 242-3). Marja Vierros has recently presented at the 
workshop “Act of the Scribe: Interfaces Between Scribal Work and 
Language Use” (Athens, April 6-8, 2017) some preliminary remarks on 
Applying Modern Authorship Attribution Methods to Papyri and Ostraca 
(abstract at http://blogs.helsinki.fi/actofscribe/workshop): cf. REGGIANI 
2017, 185. 

84 Cf. REGGIANI 2017, 181; R. Ast and H. Essler in this volume. 

85 Cf. RIANO RUFILANCHAS 2014, 160-1; ESSLER — RIANO RUFILANCHAS 2016, 
498; and the observations by R. Ast and H. Essler, M. Vierros, and G. 
Celano in the present volume. 

86 Cf. ANDORLINI 1997a, 24. For more recent works on this topic see 
BONATI 2016a, 2017, 2018a, 2018c, and BERTONAZZI 2018a. For a parallel 
exploitation of digital encoding for the development of vocabulary analysis, 
cf. ROUED-CUNLIFFE 2014 apropos of the Vindolanda corpus. 

87 Cf. also BONATI 2018b and 2018d. On the connection between digital 
editions and Medicalia Online cf. also BERTONAZZI 2018a, 43-8 and 73-4, 


and 2018b. 

88 On “the interdependence of lexica and new editions” cf. ESSLER — RIANO 
RUFILANCHAS 2016, 492. ROUED-CUNLIFFE 2014 speaks of “integrated 
indexing”. 

89 Cf. e.g. the case of the Antinoupolis papyri described in CORAZZA 2018b. 
90 Cf. ANDORLINI 2006. 

91 Cf. REGGIANI 2018b, and see L. Berkes in this volume. As he notes, text- 
image alignment could be a good compromise: but searching for the 
different abbreviation types would not be possible as well. 

92 BERTONAZZI 2018b; cf. BERTONAZZI 201 8a, 67. 

93 In general, on abbreviations in papyri see e.g. CLARYSSE 1990, DEGNI 
1999, and GONIS 2009; with special regards to documentary texts, BELL 
1953 and BLANCHARD 1974; for literary papyri, MCNAMEE 1981 and 1985. 
A typological work on the abbreviations in medical papyri has been 
preliminarily conducted by L. Iori and M. Centenari in the framework of 
the Corpus of the Greek Medical Papyri Online project (cf. http:// 
www.papirologia.unipr.it/eventi/GDS/2010/centenari-iori.html). 

94 Cf. http://www.stoa.org/epidoc/gl/latest/trans-abbrevmark.html. 

95 On the relevance of the monogram yp(@) see ANDORLINI 2018. 

96 Cf. http://papyri.info/docs/leiden_plus s.v. “Non-alphabetical character 
with symbol”. 

97 http://www.stoa.org/epidoc/gl/latest/trans-regularization.html: “It is 
most common to mark a regularization of this kind at the level of the whole 
word, rather than of individual characters affected [...]. This will make it 
easier to generate an apparatus reading for the regularized form (or the 
original form, depending on which you want to privilege), but it may also 
be impossible to identify individual affected characters in a dialect spelling 
or grammatical form. On the other hand, tagging the individual characters 
might make it easier to index or search for specific features, such as the 
iotacism of t and et”. See J. Stolk’s observation (in this volume) that 
apparatus regularizations/corrections work as textual equivalents and not 
‘better’ substitutes of the original text. 

98 For some case studies cf. BERTONAZZI 2018a, 63-4. 

99 In the earlier Duke Databank of Documentary Papyri, “the conventional 
koine form is given first, followed by numbered braces enclosing the 
scribe’s form or the edition’s misprint: e.g., dvoua {4mvoua}4 shows that 
the scribe has misspelled 6voua, Uxeép {Susap}5 that he wrote epsilon over 
alpha, avtot {6avTw}6 that he miswrote dative for genitive, AOUp 
{7A00c}7 that the edition has a misprint for AOUp” (WILLIS 1984, 169-70); 
cf. REGGIANI 2017, 216. This slight prominence given to the ‘standard’ form 
was retained in the first stages of Papyri.info, where it was included in the 
main digital text, whereas the original form as written by the scribe - 
marked with |reg|, |corr| or the former |orth| tag (cf. REGGIANI 2017, 236 
n. 119) — was displayed in the apparatus. As of September 2011, the two 


elements in the |reg| tag have been swapped with each other (cf. http:// 
digitalpapyrology.blog-spot.it/2011/09/just-posted-to-papylist-dear- 
colleagues.html). This required a huge effort, because the ancient reading 
was originally transcribed diplomatically without spirits and accents, but 
its inclusion in the text made it necessary to add them (cf. REGGIANI 2017, 
224). 

100 “Nei papiri é scritto quasi regolarmente Cu-” (ANDORLINI 1981, 61 n. 
54), which conversely should be a ‘deviating’ spelling of ‘regular’ ouvpvn 
(cf. GIGNAC 1976, 121-2). 


101 Cf. BONATI 2015. 

102 On this peculiar double fluctuation cf. BONATI — REGGIANI 2018. 

103 Cf. e.g. BOSCHETTI 2007 apropos of philological variant alignment; 
further discussion in REGGIANI 2018a. The current platform also allows for 
handling language shifts, i.e. the markup of a language or script different 
than the main document’s default. This is rendered with the tag < foreign > 
and the xml:lang attribute, the value of which may be grc for Greek words 
in a Latin text, la for Latin words in a Greek text, grc-Latn for Greek words 
in Latin characters, la-Grek for Latin words in Greek characters, and so on. 
In Leiden+ it is marked ~|x|~gre and the like. If characters or lines in a 
different language or script are omitted by the editor, this is indicated with 
the <gap reason="ellipsis"> tag (see above) including a <desc> tag 
filled with the appropriate language (e.g. Coptic, Demotic; Leiden+: (Lang: 
Coptic 1 line) etc.). It is also possible to mark up cross-language 
equivalencies, for example giving the Greek correspondent of a Coptic 
term. For this task, the current system exploits the ‘regularization’ tag by 
adding an xml:lang attribute, e.g. <choice> <reg xml:lang="grc"> 


GpaKxog </reg> <orig> APAK </orig> </choice> (Leiden+: 
< :Gipaxos = gre|reg| Db PXK:>). The explanatory note goes into the 
apparatus accordingly. 

104 Cf. REGGIANI 2018a and 2018c. 

105 A comparable case is totic (and the following forms supplied 
accordingly) in P.Fay. 204,9 (http://litpap.info/dclp/60181) vs Ionic Totot 
of the rest of the tradition of Hippocratic aphorisms. 

106 MARAVELA — LEITH 2007. The papyrus will be republished in the 
forthcoming third volume of the Papyri Osloenses. I am most grateful to 
Anastasia Maravela for sharing her drafts of the new edition and for 
discussing with me some textual and linguistic details. 

107 Orib. Syn.Fust. Ill 28, 6 and 9 = CMG VI 3, p. 75, 15-16 and 21 
Raeder; Steph. In Hp. Progn. Il 1 = CMG XI 1,2, p. 140,25 Duffy. The case 
resembles — mutatis mutandis — that of dountpov, the above-mentioned 
‘deviant’ form of €(Yoontpov “mirror”, for which BONATI 2016a, 246-8 
already proposed the rank of substandard. 

108 Cf. GIGNAC 1976, 105. 

109 The previous editors corrected it in ¢pvOpoEetdovc, but the newest 
Belles Lettres edition (PETIT 2009) prints €AuTpoEls00c (XII 11, p. 40,1; see 
PETIT 2009, xcvi-xcix for the description of the manuscript tradition). Quite 
interestingly, the author of the treatise came possibly from Egypt (cf. PETIT 
2009, 1—li), which suggests that the phonetic variation could have worked 
both ways. I discuss this and the preceding case in REGGIANI 2018a and 
2018e. 

110 Cf. T.T. RENNER, ad loc. 


111 The same phonetic exchange B/@ occurs elsewhere in the papyrus: see 
duio~atva for dupiofatva in ii,9, 14, 15. 

112 Cf. ANDORLINI 2006, 163, and 2018; REGGIANI 2018a. 

113 GENETTE 1992, 83, then GENETTE 1997, 1. 

114 Cf. GENETTE 1992, 83-4, later developed in GENETTE 1997, 1-7. 

115 On punctuation in the papyri cf. the overviews by TURNER 1987, 7-10, 
and CRIBIORE 1996, 81-3. For specific issues cf. DEL MASTRO 2017 
(Herculaneum papyri) and FUNARI 2017 (historical fragments). For the 
particular care for paratext in the digital editions of literary papyri see the 
notes by R. Ast and H. Essler in this volume. 

116 On ancient punctuation and encoding/annotating issues see the article 
by G. Celano in this volume. On filling marks in the papyri cf. BARBIS LUPI 
1992. On diacritical and lectional signs see now also NODAR DOMiNGUEZ 
2017 and MCNAMEE 2017. 

117 Cf. http://www.stoa.org/epidoc/gl/latest/trans-textpart.html for the 
difference between Divs (structural text parts) and milestones (non- 
structural text parts). A possible issue is that in fact e.g. paragraphoi may 
actually mark a subdivision between structural text parts, but the 
underlying rationale in the papyrological markup seems to be that they 
represent graphical separators or turning points marked by the ancient 
scribe. 

118 On these peculiar signs cf. BARBIS LUPI 1994 (paragraphos), BARBIS LUPI 
1988 (diple obelismene), SCHIRONI 2010, 16-18 and passim (koronis). 

119 A <hi> tag would probably be better: see below the case of eisthesis/ 
ekthesis. 

120 Diacritics and lectional signs added by different hands are another case 
of uneasy elaboration. 

121 TURNER 1987, 8; cf. also CRIBIORE 1996, 83 (“Blank spaces can be used 
as punctuation”). 

122 Vacat can be used also to render columnation in particular layouts 
(lists, accounts, etc.), but the use is not standardized. See the chapter by L. 
Berkes in this volume for some remarks on the markup of layout in 
documentary papyri. Very recently, DICKEY 2017 has dealt with particular 
layouts of bilingual texts, where the columns are handled with blank 
spaces. 

123 This case is currently encoded as an editorial apparatus note displaying 
the omitted text. 

124 Cf. REGGIANI 2016b with earlier bibliography; also BONATI 2018e. 

125 Cf. ANDORLINI 1999; REGGIANI 2018h. 

126 Cf. ANDORLINI 1997b, 159, and see above. 

127 Cf. BARNS 1949, 4—5. Online: http://litpap.info/dclp/65633. 

128 Cf. HANSON — MATTERN 2001, 72 and MAGDELAINE 2004, 63, 
respectively. Online: http://litpap.info/dclp/69007 and http://litpap.info/ 
dclp/69028. 


129 On the ecdotic relevance of line displacement in the system of the 
margins of the Greek literary papyri see SAVIGNAGO 2008 (cf. also TURNER 
1987, 8). 

130 In medical papyri ekthesis is somehow less frequent than eisthesis; a 
significant case is presented by CORAZZA 2018a. See also P.Oxy. XIX 2221r 
+ P.K6oln V 206r (http://litpap.info/dclp/61916), the aforementioned 
commentary to Nicander’s Theriaka, where the lemmas containing the 
commented passages are highlighted by ekthesis (cf. ANDORLINI 2000, 39) 
and the comments are introduced by larger blank spaces, which might be 
considered as eistheseis. 

131 Cf. http://www.stoa.org/epidoc/gl/latest/trans- 
charactershighlighted.html. 

132 Cf. REGGIANI 2018h, where I advanced a further distinction of eistheseis 
according to their appearance in the texts. It is worth noting that encoding 
eisthesis/ekthesis as a <hi> element would prove helpful also when 
handling whole indented/outdented paragraphs (see the sample in the 
Conclusions below), instead of marking each line. 

133 On this typology of signs cf. TURNER 1987, 10-12; CRIBIORE 1996, 83- 
8; COLOMO 2017; AsT 2017. 

134 P.Tebt. II 679 + P.Tebt.Tait 39-41 (II AD): http://litpap.info/ 
delp/63596; P.Johnson + P.Ant. III 214 (IV-V AD): http://litpap.info/ 
dcelp/64598 (cf. REGGIANI 2018i). 

135 Cf. WORTON — STILL 1990; POLACCO 1998; BERNARDELLI 2000; 
BERNARDELLI 2010, esp. 9-62. 

136 FINGLASS 2015. 

137 On the concept of intertextuality applied to ancient quotations cf. 
BERTI 2012, part. 439-46, about ancient historians. 

138 Cf. HANSON 1997,296; ANDORLINI 1997a, 19-20. 

139 So far, this has been possible only in the line-by-line commentary: cf. 
BERTONAZZI 2018b and CORAZZA 2018a for discussion and case studies. 

140 Cf. ANDORLINI 1997a, 22 n. 15 and 23. 

141 E.g. ii,12 mA€et GyKoc vs méAEL OyKOG w & Gow-Scholfield, wéA€t 
OAKOC I & O. Schneider (Nic. Ther. 387). According to COLONNA 1954, the 
original text could have been mA€el OAKOG, subsequently popularized in 
mwEAEL OAKOG and glossed with 6yKoc. 

142 Full analysis in COLONNA 1954. 

143 Cf. ANDORLINI 2001. For another Hippocratic papyrus preserving an 
interesting and complex textual history (P.Ant. III 184, VI AD, http:// 
litpap.info/dclp/60192) cf. HANSON 1970; in particular, "the sequences of 
the Hippocratic texts do not correspond to the one established in medieval 
tradition but seem to follow autonomous criteria" (CORAZZA 2018b, 174). 
144 While digital repositories of literary texts do exist, they usually do not 
record all manuscript variants of the texts (see above); they could well be 
connected to the papyrus texts but information would be partial. A possible 


solution may be to create multitextual digital editions of the literary texts. 
It is important, of course, to distinguish parallel passages in copies of the 
same text from quotations embedded in different texts; for the latter, the 
current platform offers the possibility to deploy the <q> tag marking 
quoted phrases (Leiden+: quotation marks + space), which can be easily 
used to differentiate the appropriate cases. 

145 Traditionally, documentary papyri preserving the ‘same’ text in 
multiple copies (for a catalogue of duplicates cf. NIELSEN 2000) are treated 
in the ‘philological’ way, i.e. collated and merged in one source archetype: 
e.g. http://papyri.info/ddbdp/p.tebt;3.1;771dupl (note the suffix ‘dupl’ 
added to the URL of the digital text, which advises about the existence of a 
duplicate of the papyrus). However, a certain degree of uneasiness is felt 
about such a practice , see e.g. in Jelle Stoop’s words: “I disagree with this 
editorial choice for two reasons. First, in a field like papyrology, every copy 
of a text deserves full consideration and [...] an archetype that would 
somehow be considered more authentic than a later copy is an editorial 
fancy. Copies of the same text, however similar, were written with a 
purpose in mind, so that edition should be more rather than less 
interesting. Second, in order to appreciate the fact that we have multiple 
copies [...], we must ask why different versions of it exist in the first place. 
The interest of these documents is, therefore, not restricted to the text 
alone, but extends to the life and afterlife of its copies in relation to one 
another. In sum, the text of just one fragment does not make for a 
satisfactory edition of understanding of this [text]. By editing the texts in 
their own right, we learn about the convention of [...] writing in [Graeco- 
Roman] Egypt” (SToop 2014, 185). A new ‘phenomenological’ 
consideration of papyrus copies is emerging (cf. YUEN-COLLINGRIDGE — 
CHOAT 2012, with interesting preliminary comments on textual differences 
between copies of the same document), but, for now, the digital database is 
following the ‘philological’ practice, with a significant loss of information. 
Giuditta Mirizio (Bologna) is currently working on this topic also from the 
perspective of digital encoding and XML annotation. On this topic, see also 
below. 

146 The proposed tag (<app type="variant'">, Leiden+ |var|) raised 
some theoretical and methodological issues, for example whether to choose 
just one manuscript variant or to encode all possible instances. Moreover, 
the <app> tag (see below) typically envisages a “lemma” (<lem>) part, 
which corresponds to the word(s) in the text, and one or more “reading” 
part(s) (<rdg>), corresponding to the alternative(s) in the apparatus, and 
it must be clearly thought how this should work in the case of philological 
variants. Currently, the Digital Corpus of the Greek Medical Papyri has 
adopted the solution to just describe the most relevant manuscript variants 
in the line-by-line commentary. 

147 On translations in the tradition of ancient Greek medical texts see e.g. 


GAROFALO — FORTUNA — LAMI — ROSELLI 2010. 

148 BERTONAZZI 2018a, 242-3. 

149 Cf. e.g. ANDORLINI 2014. “La possibilita di identificare alcuni papiri 
con trattazioni di un autore tramandato solo indirettamente inserisce 
tasselli nuovi nella complessa stratificazione della trasmissione indiretta, 
soprattutto quando sono i papiri i soli testimoni diretti di autori 
tramandatici per excerpta e citazioni (Apollonius Mys, Heras, Heliodorus, 
Herodotus Medicus)” (ANDORLINI 1997a, 22). 

150 On which cf. KOLLESCH 1963 and FISCHER 1998 respectively. For 
general considerations about catechisms on papyrus see also BONATI 2018e 
and BERTONAZZI 2018a, 57-62 , as well as REGGIANI 2016b. 


151 LEITH 2007; cf. already ANDORLINI 1997b, 160. 

152 Cf. MARGANNE 1986 and now BERTONAZZI 2018a, 237-8. 

153 Cf. BERTONAZZI 2018a, 236-7, with earlier bibliography. 

154 REGGIANI 2018 and 2018}. 

155 Cf. TOTELIN 2009a, part. chapters 1-3. 

156 Comp.med.loc. 1 1 = XII 423,13-15 K. (a recipe is found in a dead 
physician’s parchment notebook and then forwarded to Galen); Antid. 15 = 
XIV 31,10-15 K. (exchange of recipes); Indol. 33-5 (his own personal 
collection of worldwide prescriptions, destroyed by the AD 191 fire). See 
also P.Mert. I 12,13-24, attesting to the very same activity of exchange 
between two colleague physicians in Egypt. 

157 The ancient practice of collating several copies (antigrapha) of medical 
texts is attested above all by Galen, who noted several degrees of 
manuscript divergences, ranging from small linguistic variations to major 
discrepancies in the content, e.g. in the ingredients and quantities (cf. 
ANDORLINI 2000, 38-9; ANDORLINI 2003, 14-15; TOTELIN 2009b; BONATI 
2016b, 64-5), but we know of other cases in which the ancient readers 
produced ‘personal’ copies that became, by means of reformulations and 
abbreviations, new recensions of the same text (cf. ANDORLINI 2000, 37-8). 
In some cases it is possible to speak of erroneous or inaccurate deviations 
from the original (it is the case with Galen’s treatises, for which the ancient 
author himself stigmatised the circulation of incorrect versions of his own 
books: De libris propriis I 91-3 Miller = XIX 8-11 K.; cf. HANSON 1985, 
43-5) but in other cases it is difficult to go back to a genuine text (HANSON 
1985, 34-5 makes the example of Hippocratic letters). In general, on 
Galen’s ‘philological’ work cf. HANSON 1998; ANDORLINI 2003, 15-16; 
DORANDI 2014; BONATI 2016b, 63-5. 

158 In fact there are some interesting variants, which as usual show how 
papyri can contribute to the history of the texts: in particular, at line 10 
(kaXovot pap. : KaxXovot Kai Gal.) the papyrus offers a superior reading, 
since the conjunction is syntactically unfit; further discussion in CORAZZA 
2016 ad locc. On the value of the variants attested in the papyri see above. 
159 Cf. MARGANNE 1980, 182-3. 

160 In particular, the first prescription of the second column (Il. 1-3) 
parallels Gal. Comp.med.loc. XII 593,14 K. verbatim, while the following 
two (ll. 4-8 and 9-15) show partial overlaps with ibid. XII 594,1-4 (= Aet. 
VI 50,75-9) and XII 594,7 ff. K. All these recipes are ascribed to Heras. The 
remaining traces of fifteen lines, articulated in four more recipes, could not 
be identified with any known text. 

161 BONATI 2016b, 66. 

162 The Leiden+ tag for parallel passages is meant to mark omitted text 
that is supplied on the ground of parallels, so it does fall into a different 
typology (see below, editorial interventions). 

163 On the practice of annotating medical treatises with scholia and 


comments cf. ANDORLINI 2003; in general, on scholia and commentaries in 
the papyri cf. MESSERI SAVORELLI — PINTAUDI 2002.. 

164 Cf. MANETTI — ROSELLI 1994. 

165 cf. ANDORLINI 2000, 41-2; ANDORLINI 2003, 19-24 

166 Cf. CORAZZA 2018a. 

167 The two cases are tightly related, and can even merge together in the 
so-called “commented editions” discussed by VANNINI 2015. 

168 See the observations by CORAZZA 2018a. On the interactions between 
text and glosses, very interesting is the analysis by MANIACI 2002, though 
referred to later types of texts. 

169 E.g. <lb n="1,minf"'/> for lines in the bottom margin (the other 
margins are indicated with msup, ms, md). In Leiden+ this information is 
added to the line number accordingly. 

170 BERTONAZZI 2018a, 53-4; cf. also ibid., 73 for its digitization; CORAZZA 
2018b, 46-57; on the annotations, MCNAMEE 2007, 463 ff. 

171 For the cases of scribal additions, Leiden+ recovers some traditional 
Leiden conventions, so that supralinear insertions are encoded between two 
slashes (\x/) and infralinear insertions with reversed slashes (//x\\). The 
other types of additions are rendered as ||left:x||, ||right:x||, ||interlin:x||. 
172 Cf. BRODERSEN 1994, 103-7. 

173 The papyrus presents also interesting cases of intertextuality (see 
above): of Ep. 5, it transmits the shorter form with certain variations, while 
Ep. 6a, a letter to Gorgias previously unattested, has striking coincidences 
of phraseology with ‘canonical’ Ep. 6, addressed to Demetrius. 

174 Sometimes a marginal annotation can be swallowed up in the main 
text, generating a textual issue that can be likely explained only by means 
of metatextual correlations: this is the case with P.Gen. inv. 111 (http:// 
litpap.info/dclp/63819), where the reading Pdupatos Ff pi|[t]ov (Il. 15- 
16), presenting two technical terms that are almost synonyms, may stem 
from a gloss (cf. BERTONAZZI 2018a, 241-2). 

175 It is worth noting that similar graphical devices are used by the author 
of the Anonymus Londinensis to frame alternative versions of the same 
passage (cf. CRIBIORE 2018 and see below). 

176 I thank very much Todd M. Hickey and Derin McLeod for the help in 
getting a high-resolution picture of the fragment. I mention this case in 
REGGIANI 2018a, 2018b, 2018c, with discussion of the tentative code used 
to digitize it. 

177 Writing a word between dots could be a way to highlight later 
corrections, like e.g. the koppa in P.Eirene III 25, 3 (III AD; see comment ad 
loc.). 

178 MARGANNE 1981b, 76. 

179 TOMASI — ZAJA 2002 discuss some interesting solutions for the 
encoding of marginal writings, though dealing with quite later types of 
texts. 


180 Leiden+ employs the double square brackets as in conventional 
printed editions; only square brackets for plain erasures, [/ ] for slashes 
and [X ] for crosses. 

181 Leiden+ employs a |subst| tag working the same wasy as |reg| and | 
corr|. An interestingly complex case (P.Strasb. inv. 1187, A, i,11 = http:// 
litpap.info/dclp/59968) is presented by BERTONAZZI 2018a, 64 and 69-70: 
an ancient scribal correction, involving the insertion of a letter supra 
lineam, was read differently by two editors, so that they proposed two 
different interpretations, one of which involved a regularization. The newer 
reading is OMElALOTMV corrected in OUELAL@/TOV i.e. OLLALWTOV; the 
previous reading was y@ det ALOTwV corrected in y@ det ALo/Twv. In 
Leiden+ this is encoded as <:<:<:opltAlotTdvlreg|opEelAt\o/TOv: > | 
subst|QWElALoTOv: >led|yam Sel <:At\o/tTwv|subst|ALotwV: > =ed.pr.:>, 
and the XML is cross-nested accordingly, so that the current display on the 
platform appears quite messed up. 

182 On medical autograph papyri cf. ANDORLINI 1997a, 22 with earlier 
bibliography; MARGANNE 2004, 90-1. 

183 Cf. —http://www.tei-c.org/Activities/Council/Working/tew19. html]; 
PIERAZZO 2008. 

184 CRIBIORE 2018: see in particular the case of the medical Anonymus 
Londinensis and the related discussions of double versions. From the 
computational viewpoint, cf. MACE — BARET — BOZZI — CIGNONI 2006 (in 
particular, PASSAROTTI 2006). Genetic criticism can be applied to some 
documentary categories which show a certain complexity of textual 
composition. One may recall, just for instance, the legal documents of 
Ammon’s archive, produced in multiple versions (P.Ammon II; cf. 
CRIBIORE 2018); Raffaele Luiselli’s considerations about authorial 
revisions in Roman letters and petitions (LUISELLI 2010); the mostly 
neglected cases of duplicates recently ‘rediscovered’ by Malcolm Choat and 
Rachel Yuen-Collingridge (YUEN-COLLINGRIDGE - CHOAT 2012); the 
composing process of administrative reports studied in the Project Synopsis 
at the Heidelberg University especially by Uri Yiftach (cf. REGGIANI 2016c); 
the very recent discussion on drafts and copies by Andrea Jordens (JORDENS 
2017). 

185 See, however, DAMON 2016 for criticism of this way of handling 
apparatus readings by TEL 

186 Cf. CORAZZA 2018a; BERTONAZZI 2018a, 64—7 (with case studies) and 
2018b. 

187 The theoretical assumption that the fragmentary status of the papyri 
may be thought as a (paradoxical) sort of ‘non-voluntary quotation’, 
selected by the chance and by the material circumstances rather than by 
the will of an author, would allow to envision a transtextual link between a 
‘virtual’ hypertext (the original document, lost, more or less recoverable in 
a philological way) and the concrete hypotext (the actual fragment; cf. 


REGGIANI 2016a; see also ROMANELLO — BERTI — BOSCHETTI — BABEU — CRANE 
2009, 160 and 162: “[...] fragments do not actually exist outside of 
scholars’ interpretations. [...] Fragments are always _ scholarly 
reconstructions and interpretations of the content and structure of lost 
works”). This may allow for creating several multiple layers for editorial 
alternatives and supplements too, thus avoiding complicated nested tags as 
in the case of multiple alternatives in a series of different supplements or 
modern editorial readings (see the samples provided in the Conclusions 
below). 

188 For two cases in the medical papyri, see BERTONAZZI 2018a, 67-8 
({THvV otvapv} in P.Strasb. inv. 1187, A, i,14 and {o}oxynyatioavtes in 
P.Lond.Lit. 166, iv,6). 

189 REGGIANI 2017, 137 ff. 

190 See above and the Anagnosis section of R. Ast’s and H. Essler’s chapter 
in this volume. 

191 YOUTIE 1996. For the following observations, I refer to HANSON 1996, 
HANSON 1997, 302-4, and ANDORLINI 2003, 26-8. 

192 Cf. YOUTIE 1996, 1-3; ANDORLINI 2003, 26-7. 

193 HANSON 2010, 197-8, and 1997, 303. 

194 HANSON 2010, 197-8, and 1997, 303. 

195 HANSON 2010, 199; cf. also the observations by BONATI 2016b, 60-9. 
196 Cf. ANDORLINI 2003, 26-7. 

197 The DCLP digital edition of the Michigan Medical Codex has been 
encoded by students of the Papyrology class (F. Bertonazzi, F. Corazza, L. 
Rizzardi, M.E. Galaverna, C. Bottioni, M. Catania, F. Giraldi, P. Lillo, G. 
Saccani, E. Mazzetti, L. Mazzolari, A. Brunazzi, E. Angolani, N. Pajares 
Collado, C.M. Ferrari) under the supervision of L. Iori, M. Centenari, I. 
Bonati, and N. Reggiani. 

198 L. Berkes, in his chapter for this volume, asks: “should we expect 
online editions to conform to the norms of traditional printed editions or 
should we accept them as a slightly different form of publication?” 

199 BABEU 2011, 36. 

200 BODARD — GARCES 2009, 96. 


201 BABEU 2011, 36. 

202 NICHOLS 2009. 

203 BABEU 2011, 39. 

204 CAYLESS 2010, 148. Quite interestingly, Cayless’ picture exactly 
parallels the arguments brought by HANSON 1997 apropos of the 
transmission of ancient medical fragmentary texts, and the “accretive 
model of composition” (e.g. p. 305, see above) that she envisages to 
overcome the limits and rigidities of stemmatological interpretations. 

205 Cf. also CAYLESS 2010, 170. 

206 Principal investigators on the project were Rodney Ast (DFG) at the 
University of Heidelberg and Roger Bagnall (NEH) at New York University. 
Both Ast and Holger Essler have presented the project on numerous 
occasions, and a description of the initiative can be found also in REGGIANI 
2017, 250-4. 

207 The former, which is largely confined to Greek literature, is hosted at 
http://stephanus.tlg.uci.edu;the latter, which comprises both Greek and 
Latin, can be found at http://www.perseus.tufts.edu/hopper. Colleagues at 
Tufts University and the University of Leipzig are working in a much more 
ambitious project called the Open Greek and Latin Project (OGL) to expand 
Perseus in order to include all Greek and Latin texts, http://www.dh.uni- 
leipzig.de/wo/projects/open-greek-and-latin-project.DCLP has the potential 
both to enhance OGL and benefit from it. 

208 The project is very much in tune with current efforts in classical 
studies to account for physical aspects of ancient witnesses. One example of 
these efforts is the University of Heidelberg’s Sonderforschungsbereich 933, 
Materiale Textkulturen; more information about this SFB can be found at 
http://www.materiale-textkulturen.de. 

209 See _ http://litpap.info/dclp/62823, which contains extensive 
bibliography. 

210 The ostrakon was first published by VIERECK 1925, 254-5, and 
subsequently in PORDOMINGO 2013, no. 27. Wolfgang Luppe arrived 
independently at the same conclusion about Viereck’s reading of mQAtv in 
his re-edition of the ostrakon in CPF II, 2, pp. 358-9. 

211 See http://www. litpap.info/dclp/64529. 

212 (12 November 2017). 

213 A list of Herculaneum texts entered is available at: http://epikur- 
wuerzburg.de/thv. 

214 GIGANTE 1979. 

215 Examples are TM 62400, Philodemus De pietate, and TM 62419, 
Philodemus De poematis 1 (cf. e.g. the scheme in OBBINK 1996, 43; JANKO 
2003, 105). 

216 See RIANO RUFILANCHAS 2014. 

217 Cf. Martin Jenckel, Syed Saqib Bukhari, Andreas Dengel: “anyOCR: A 
Sequence Learning Based OCR System for Unlabeled Historical 


Documents,” 23nd International Conference on Pattern Recognition 
(Mexico 2016). 

218 http://papyri.info. For a detailed discussion see REGGIANI 2017, 222 ff. 

219 Born-digital editions were already included in the original concept of 
Papyri.info. 

220 DDbDP 2013 1: “Account of Barley and Wheat”: http://papyri.info/ 
ddbdp/ddbdp;2013;1. 

221 DDbDP 2015 1: R. AST -L. BERKES, “A Letter from Athanasios the 
Scholastikos Mentioning Constantinople” (http://papyri.info/ddbdp/ 
ddbdp;2015;1); DDbDP 2015 2: A BERNINI, “A Requisition of Workmen and 
Donkeys” (http://papyri.info/ddbdp/ddbdp;2015;2); DDbDP 2015 3: L. 
BERKES, “A Wife in Prison: A Letter from 7th Century Fayum” (http:// 
papyri.info/ddbdp/ddbdp;2015;3). 

222 AST — BERKES — COWEY 2016. Cf. REGGIANI 2017, 241. 

223 These were announced (together with DDbDP 2013 1) in AST — BERKES 
— COWEY 2017. 

224 DDbDP 2016 1: http://papyri.info/ddbdp/ddbdp;2016;1. 

225 DDbDP 2016 2: http://papyri.info/ddbdp/ddbdp;2016;2. 

226 For discussion of the criteria behind this referencing system see below. 

227 It is possible in the DDbDP to add introduction and line-by-line 
comments to any published papyrus as well: cf. REGGIANI 2017, 241. 

228 The content of APIS has been integrated into Papyri.info. 

229 The layout of the edition may depend on browser settings. 

230 For discussion of the methodological advantages and the development 
of papyrological digital resources, cf. REGGIANI 2017, passim. 

231 The images are of course hosted by the Gothenburg University Library. 
The online publication of images not available on the instituional websites 
of their respective holders and copyright owners may be an additional 
challenge in other cases. 

232 If the whole article was not available online, the relevant entry in the 
online Bibliographie Papyrologique via http://papyri.info/bibliosearch was 
linked. 

233 This method is being applied on literary papyri in the Anagnosis 
project: http://www.kallimachos.de/kallimachos/index.php/Anagnosis (see 
the chapter by R. Ast and H. Essler in the present volume). 

234 For the usefulness of online paleographies cf. PapPal (http:// 
www.pappal.info/). For an online papyrological school see the online 
Arabic Papyrology School (http://www.apd.gwi.uni-muenchen.de:8080/aps/ 
home/), which uses the method of matching letter forms with the 
transcription in order to introduce students to the paleography of Arabic 
papyri. 

235 http://papyri.info/docs/checklist. See REGGIANI 2017, 23 ff. and 29 ff. 
on the Checklist and bibliographical standards in Papyrology. 

236 Cf. REGGIANI 2017, 241. 


237 It has not been decided yet whether these publications will be included 
in the Sammelbuch or not. 

238 Edited by R. AsT, L. BERKES, and J. M.S. CoWEy: http://www.uni- 
heidelberg.de/fakultaeten/philosophie/zaw/papy/projekt/bulletin.html 
239 On the other hand, producing a digital-only volume is certainly 
possible. 

240 However, as R. Ast pointed out to me, the situation was much worse in 
the late 1990s and early 2000s: the difference is that users’ expectations are 
much higher now. 

241 Cf. REGGIANI 2017, 250 ff. and the chapter by R. Ast and H. Essler in 
the present volume. 

242 In fact, the editorial histories of certain submissions at Papyri.info often 
contain discussion in French, German, Italian or even other languages. 

243 “Machines have not yet been devised which can cope with variations 
inherent in handwriting”, p. 71. Transkribus promises to find a solution to 
the problem of the automatic transcription (https://transkribus.eu/ 
Transkribus). For the automatic collation of transcriptions, see Collatex 
(https://collatex.net/about), the successors of the well-known Peter 
Robinson’s Collate, the equally well-known Wilhelm Ott’s TUSTEP (http:// 
www.tustep.uni-tuebingen.de/tustep_eng.html), and finally Juxta (http:// 
www.juxtasoftware.org). West’s ‘traditional’ critical edition of the Odyssey 
was recently published posthumously (WEST 2017). 

244 WEST 1973, 72, and see the example on p. 71. 

245 HAINSWORTH 1982, xxxiv. 

246 Texts transmitted by multiple witnesses and editions produced via the 
traditional, post-Lachmannian textual criticism. 

247 A project of P. Andorfer and Ks. Zaytseva of the Austrian Centre for 
Digital Humanities (ACDH, Vienna) and G. Franzini of the UCL Centre for 
Digital Humanities (UCLDH, London); for the data, see FRANZINI 2012-; for 
the web site, see FRANZINI — ANDORFER — ZAYTSEVA 2016-. 

248 See also the Catalogue of Scholarly Digital Editions, compiled by P. Sahle 
(http://www.digitale-edition.de). Neither the DIG-ED-CAT nor Sahle’s 
Catalogue register the Homer  Multitext — Project (http:// 
www.homermultitext.org). On the subject of the digital scholarly editions, 
see in general PIERAZZO 2015. 

249 DIG-ED-CAT ID 10, 24, 57, 58, 73, 86, 92, 93, 100, 163, 244. The ID 
17, the edition of the Euripides scholia managed by D.J. Mastronarde 
(http://euripidesscholia.org/EurSchHome.html), started in 2007 and 
currently updated in November 2017, is not included in the catalogue of 
the digital scholarly editions not because this edition is not digital, but 
because it is credited not to be a scholarly edition, term by which the DIG- 
ED-CAT project managers “mean editions with a strong critical component” 
(https://dig-ed-cat.acdh.oeaw.ac.at/faq). I am not able to understand why 
the Euripides scholia are not a ‘scholarly’ edition, but an edition like 


Sappho’s poems (http://inamidst.com/stuff/sappho), “an attempt to collect 
Sappho’s entire work together in one page — with Greek originals, succinct 
[?] translations, and commentary [in fact, absent]”, it is. Obviously, these 
catalogues cannot provide any guarantee of completeness. 

250 ID 24 (IOSPE = Inscriptions of the Northern Black Sea — http:// 
iospe.kcl.ac.uk/corpus/index.html),86 (IGCR = Inscriptiones Graecae in 
Croatia repertae — http://www.ffzg.unizg.hr/klafil/dokuwiki/doku.php/ 
z:epidoc-hrvatska), 92 (InsAph = Inscriptions of Aphrodisias Project — http:// 
insaph.kcl.ac.uk/index.html), 93 (IRCyr = Inscriptions of Roman Cyrenaica — 
https://ircyr.kcl.ac.uk). For the s.c. “special types of edition”, i.e. the 
edition of papyri and inscriptions, see WEST 1973, 94-5. 


251 ID 57 (Derveni Papyrus —- https://chs.harvard.edu/CHS/article/ 
display/5418). 

252 ID 10 (Codex Sinaiticus - http://codexsinaiticus.org/en). The DIG-ED- 
CAT does not include a digital scholarly edition very similar to the Codex 
Sinaiticus, that is Palamedes (PALimpsestorum Aetatis Mediae EDitiones Et 
Studia — http://www.palamedes.uni-goettingen.de), the edition of the 
palimpsest mss. Hierosol. Sancti Sepulcri 36 and Par. Gr. 1330. 

253 ID 244 (http://digitalathenaeus.org). The Digital Athenaeus is a “work 
[...] focused on annotating quotations and text reuses in the Deipnosophists 
in order to [...] provide an inventory of authors and works cited by 
Athenaeus, and to implement a data model for identifying, analyzing, and 
citing uniquely instances of text reuse in the Deipnosophists”, where the 
Greek text is the digitization of the Teubner edition of KAIBEL 1887-90 
without critical apparatus (see also BERTI - DANIELS — STRICKLAND — 
VINCENT-DOBBINS 2016; BERTI — BLACKWELL — DANIELS — STRICKLAND — 
VINCENT-DOBBINS 2016). S.D. Olson is at work, in order to produce a new, 
‘traditional’ critical text of the Deipnosophists (the first volume of this 
edition will be published in 2018). 

254 Malaspina in MALASPINA — DELLA CALCE 2017, 58-9: “Con la formula 
‘edizione digitale’ oggi si intende praticamente di tutto: riproduzioni di 
epigrafi, scansioni di brogliacci, collazioni di varianti e cosi via. Anche se si 
aggiunge l’aggettivo ‘critica’, che nella filologia classica tradizionale 
indicherebbe un prodotto ben definito, non si ottiene un quadro pit 
omogeneo e soprattutto si vede spesso gabellato per ‘critico’ cio che 
sarebbe pit onesto definire ‘diplomatico’, ovvero la mera trascrizione di un 
testimone e/o la giustapposizione di varianti senza nemmeno porsi il 
problema della vera lectio”. With reference to the Romance studies, Rinoldi 
in BERNAGOU — PALUMBO — RINOLDI 2016, 41-4 underlines some 
consequences of the increasing online diffusion of the ‘virtual manuscripts’, 
if compared to the lack of digital critical editions: even though the 
application of this approach should be ideally restricted to documentary 
texts, texts transmitted by a single ms., and particularly venerable mss., the 
alignment of reproductions and transcripts of mss. could promote the 
return of the bad Bédierism, that is the fetish of the bon manuscrit (see 
below). See PIERAZZO 2011 on the digital scholarly edition of documentary 
texts. 

255 Malaspina in MALASPINA — DELLA CALCE 2017, 58-62. 

256 See also Tomasi in ITALIA — TOMASI 2014, 120, noticing the need of 
shared criteria such as verifiability of the sources, reliability of the 
institution promoting the edition, presence of curators, scientific layout, 
dates of creation and updating, absence of commercial interference (and, 
hopefully, absence of mistakes). In her opinion, to the absence of shared 
criteria, it is to be added that the DH are sometimes perceived by 
philologists as a mere instrument useful for speeding up procedures rather 


than as a new way of understanding the edition, and that the interface 
often hides the methodological accuracy that has governed the process of 
creating the edition. 

257 See e.g. D’IORIO 2010; Italia in ITALIA - TOMASI 2014, 128-30, 
especially on the analytic genetic editions. 

258 See MONELLA 2012, n. 35, with bibliography, reconsidering “the 
historical and anthropological/ethnological foundations of the discipline”. 
259 See e.g. BOSCHETTI 2009, 5. 

260 In the update of November 2017 Mastronarde announces the online 
publication of the edition of the scholia on Orestes 1-500 and, in the 
meantime, the forthcoming open-access publication of the Preliminary 
Studies on the Scholia to Euripides. The page about the 136 sigla used for 
Euripidean manuscripts is new, with the possibility to download them as 
Excel spreadsheet (EurSiglaTable.html), together with an updated 
‘Manuscripts page’ with additions. 

261 The description of the manuscripts is the only part that has been 
substantially updated (in 2016: https://euripidesscholia.org/ 
EurSchMSS.html; in 2017: https://euripidesscholia.org/EurSchMSSnew. 
html). Also, Mastronarde has been able to improve Schwartz’s collations of 
M, B, and V; a greater progress has been made adding some lemmas and 
glosses of C, all the scholia in H (the Jerusalem palimpsest, for which see 
supra, n. 10), ms. unknown to Schwartz, and those in O (Schwartz wrongly 
dated the ms. to 15th century, but it is now credited to be written ca. in 
1175). Mastronarde does not include for the moment the ancient 
manuscripts with marginal and interlinear notations, for which see 
MCNAMEE 2007, 253-7. 

262 Not always the XML method is accepted: Schmidt’s Ecdosis (SCHMIDT 
2016, 100-1), a back-to-front development model providing a user-driven 
framework, aims to create, publish and share digital scholarly editions 
without using XML, seriously affected by “the problem of ‘markup 
variability’ - the tendency by different encoders to mark up the same 
features in different ways”. So, “instead of the linear text of XML, with 
embedded tags designed to apply abstract formats, Ecdosis uses a non-linear 
text and separate markup to describe textual properties, which are not 
arranged hierarchically, as in XML, but are allowed to overlap”. For the TEI 
encoding limits, see CUFALO — MUGGITTU 2016, 89-91; in many of his 
contributions Schmidt underlines the structural inadequacy of the 
embedded generalized markup for cultural heritage texts (see e.g. SCHMIDT 
2010), and it is no coincidence that XML / TEI are not always adopted for 
these purposes. 

263 See at https://euripidesscholia.org/EurSchStructure.html. 

264 See at https://euripidesscholia.org/EurSiglaTable.html for the different 
sigla adopted by the modern editors of scholia. 

265 Vatican, Biblioteca Apostolica Vaticana, gr. 909, ca. 1250-1280. 


266 SCHWARTZ 1887, viii. 

267 Veteres and witnesses before ca. 1261, recentiores (witnesses after ca. 
1261) containing old scholia, witnesses for Moschopulean scholia on the 
Triad plays, witnesses for Thoman scholia, witnesses for Triclinian scholia, 
witnesses for Miscellaneous Palaeologan scholia. 

268 HARRIS 2014, 84-5. 

269 http://www.homermultitext.org/about.html; see also DUE — EBBOTT 
2010, 154: “unlike the standard format of printed editions, which intend to 
offer a reconstruction of an original text as it supposedly existed at the time 
and place of its origin, the Homer Multitext offers the tools for discovering, 
viewing, and understanding a variety of texts as they existed in a variety of 
time and places”. 

270 See DUE — EBBOTT 2010, 153 and already DUE — EBBOTT 2009, 5: 
“textual criticism as practiced is predicated on selection and ‘correction’ as 
it creates the fiction of a singular text. The digital criticism we are 
proposing for the Homer Multitext maintains the integrity of each witness 
to allow for continual and dynamic comparison, better reflecting the 
multiplicity of the textual record and of the oral tradition”. 

271 The ‘fluidity’ of the Homeric text is in accordance with the general 
concept of the digital text; see BABEU 2011, ix: “As recently as a generation 
ago, the ‘text’ in classics was most often defined as a definitive edition, a 
printed artifact that was by nature static, usually edited by a single scholar, 
and representing a compilation and collation of several extant variations. 
Today, through the power and fluidity of digital tools, a text can mean 
something very different: there may be no canonical artifact, but instead a 
data set of its many variations, with none accorded primacy”. 

272 Venezia, Biblioteca Nazionale Marciana gr. Z. 454 (coll. 0822, the 
famous Venetus A); Venezia, Biblioteca Nazionale Marciana gr. Z. 453 (coll. 
0821 = Venetus B); Venezia, Biblioteca Nazionale Marciana gr. Z. 458 (coll. 
0841 = Allen U4); Escorial, Real Biblioteca fonds principal y. I. 01 (Andrés 
294 = Allen E3 = West E); Escorial, Real Biblioteca fonds principal Q. I. 
12 (Andrés 513 = Allen E4 = West F); Genéve, Bibliothéque de Genéve 
fonds principal gr. 44. Only the Veneti A and B are completely digitized; we 
have also a sample of the Genav., while the images of both Scor. are 
currently unavailable. 

273 The Bankes Papyrus (P.Lond.Lit. 28 = TM 60500 = M-P3 1013 = 
LDAB 1623 = Allen-Sutton-West 14); see also https://www.bl.uk/ 
collection-items/the-bankes-homer. 

274 DUE — EBBOTT 2010; BIRD 2010. For the following presentation of these 
books I am partially indebted with the dissertation of MARTINI 2013, 32-45. 
I take this opportunity to remember the very fruitful collaboration with 
Isabella Andorlini in the supervision of Martini’s work. 

275 E.g. WEST 1967. 

276 BIRD 2010, 34-6; see also DUE — EBBOTT 2010, 8: “where different 


written versions record different words, but each phrase or line is 
metrically and contextually sound, we must not necessarily consider one 
‘correct’ or ‘Homer’ and the other a ‘mistake’ or an ‘interpolation’”. 

277 NAGY 1996, 113. Already DI LUZIO 1969, 151 proposed to insert in the 
margin of the critical text the equivalent variations obtained from the 
papyri, so that the reader could choose the lesson according to his ‘taste’; 
and the pleasure of choice would not be reserved only for the learned 
editor. 

278 See BIRD 2010, 34-7. 

279 It is the case, probably, of Il. VIII 526, discussed by BIRD 2010, 57-8, 
but see especially his n. 151. 

280 BIRD 2010, 92-6. 

281 BIRD 2010, 95. 

282 See BIRD 2010, 96-100. 

283 See especially ch. 4, Iliad 10. A Multitextual Approach (pp. 153-66), and 
parts II and II (Texts and Commentary, pp. 169-382). 

284 DUE — EBBOTT 2010, 153. 

285 DUE — EBBOTT 2009, 7: “our textual criticism of Homeric epic, then, 
needs to distinguish what may genuinely be copying mistake and what are 
performance multiforms”. 

286 DUE — EBBOTT 2009, 5. 

287 VANHOUTTE 2007, 165-6. 

288 DUE — EBBOTT 2009, 2. 

289 See also DUE — EBBOTT 2009, 33: “as we continue to build up our 
collection of texts, there are still questions to be answered about how to 
construct the architecture to achieve the visual representation we envision 
and that will achieve the results we have described here”. 

290 Three magic words in the digital editing. Their relevance is out of the 
question, but often editors do not go beyond the simple enunciation. 

291 WEST 2001, 160 n. 5. 

292 P.Mich. inv. 6972 (TM 61210 = M-P3 864.1 = LDAB 2350 = Allen- 
Sutton-West p609); P.Berol. inv. 11911 + 17038 + 17048 + 21155 (TM 
60757 = M-P3 855.1 = LDAB 1883 = Allen-Sutton-West p425 + 430); 
P.Cair.Masp. inv. 67172-4 + P.Berol. inv. 10570 + P.Strasb. inv. G 1654 
+ P.Rein. II 20 (TM 61072 = M-P3 658 = LDAB 2209 = Allen-Sutton- 
West p46). 

293 So, e.g., the commentary to Il. X 10 (p. 247). 

294 E.g., the examples drawn from the Platonic paradosis by CUFALO — 
MUGGITTU 2016, 92, in order to illustrate the relevance of a more 
comprehensive recensio, have nothing to do with the IT tools. 

295 Cf. WAGNER — OUTHWAITE — BEINHOFF 2013, 4. 

296 http://papyri.info/. 

297 A very clear textbook on linguistic annotation and corpus linguistics in 
general is KUBLER — ZINMEISTER 2015. See also e.g. WYNNE 2005 on 


developing linguistic corpora. 

298 CELANO 2017. 

299 https://sematia.hum.helsinki.fi. I warmly thank the developer, Erik 
Henriksson, for all his ideas and efforts. 

300 SOSIN 2010, cf. also REGGIANI 2017, 232-40. 


301 This type of alarm system has not yet been established, but it is on our 
agenda. 

302 https://perseusdl.github.io/treebank_data; BAMMAN — CRANE 2011. 

303 https://sourceforge.net/p/epidoc/wiki/Home. 

304 VIERROS — HENRIKSSON 2017. 

305 Both treebanks have also been modified for the Universal 
Dependencies site (http://universaldependencies.org), where they can be 
accessed together with many other languages. 

306 Version 1.1: BAMMAN —- CRANE 2008, version 2.0: CELANO 2014. 
Version 2.0 is to be followed, but version 1.1 has sometimes more useful 
examples and more detailed explanations. 

307 http://sites.tufts.edu/perseids. 

308 There are fields for the description of the handwriting in the edition or 
some other scholarly source, the description of the handwriting by the 
annotator, and the “Same hand” field, i.e. list of other documents, where 
the same hand is said to appear. These fields are text-based, and thus they 
do not provide good searchable data. Every papyrologist is also well aware 
of the lack of precision of these descriptions in different editions. 

309 For each person the annotator can add the name, title and the 
Trismegistos People ID (http://www.trismegistos.org/ref/index.php). 

310 In its current state, the field is not very usable, user-friendly or 
accurate; the list of other documents where the same hand appears is done 
in stable URLs of the documents in PN, but one document can contain 
several hands. 

311 BULOW-JACOBSEN 1997, 69. 

312 E.g. SETS Treebank Search, PML Tree Query Engine or XQuery/Basex, cf. 
VIERROS — HENRIKSSON 2017, 13. 

313 One thorough treebank-based study on ancient languages is 
KORKIAKANGAS 2016, in which the author has been able to study under 
which conditions the Latin accusative began to be used as the subject case 
in VII and IX centuries. 

314 https://sematia.hum.helsinki.fi/tools. 

315 See, however, DAHLGREN 2017, 90 ff. on phonological variation of /o, 
u/ possibly playing a role in case variation. 

316 VIERROS — HENRIKSSON 2017, 13. 

317 Cf. HORROCKS 2010, 109-10 and 143-4 on the developments of past- 
tense morphology. 

318 All three in O.Claud. II 236. 

319 The project “Digital Grammar of Greek Documentary Papyri” (ERC 
Starting Grant 2017 no. 758481) will use and develop these tools. 

320 E.g. Mahaffy in P.Petr. I 12 (1891), n. tol. 15 

321 P.Lond. I (1893), p. vi. 

322 VAN MINNEN 1993, 5-7. 

323 The addition of sic to unconventional language, as referred to in 


P.Lond. I (see quote above), is also found in the early BGU editions, next to 
the regularizations in the apparatus. For example, in BGU II 451 we find 
TaxELoV, lL. TaxLov (1. 11), domtioeoGat with sic above e (1. 9) and dona| 
gOueda o€ with sic above o€ (ll. 11-12). This is a good example of the 
challenges faced during digitization of these older editions. All three were 
initially entered into the DDbDP as regularizations in the apparatus (L. 
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129.... TOV | Va... Kai yyavat nepLi] 
etn: vdpeiav (Ll. Vdpiav), mapgoxev Av|Ti TOD ogvyyeiov (L 
Ogvyylov) KOAAOUpinv Vdpei|[,,. Jav (Ll Vspiav): “we had all the other 
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texts. In Zonar. 6 524,27 Tittmann s.v. 5igSptov the term KaOESplov is used 
as the interpretamentum of 81€S5ptov, a compound of &pa meaning a “seat 
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Ka0edpiwv appears to be the explanation of Th. 454-5 nwAuK@v / 0’ 
€5wAiwv. Thus, the plural of kabgSplov — as well as of KaOESpa — seems to 
acquire the extended semantic value of “bower, abode”, that is one of the 
meanings of the plural £5wWAta (cf. LSJ9 478 s.v.), even though the second 
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449 The only available image of this papyrus is black and white and is 
stored in the photographic archive of CEDOPAL (University of Liége). 
Under my request Prof. Marganne, to whom I express all my gratitude, 
provided me with the available image. Then, in order to have the best 
image possible to check the text, I purchased high-quality images of the 
recto and the verso of the Rylands papyrus (format: large TIFF 600 dpi) 
from the Centre for Heritage Imaging and Collection Care (CHICC) of the 
University of Manchester Library. I would like to take the opportunity to 
thank John Hodgson, Manuscript and Archives Manager, and Tony 
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